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ABSTRACT 

Thfs  paper  describes  a  system  for  the  computer 
understanding  of  English.  The  system  answers  questions/ 
executes  commands/  and  accepts  information  in  normal  English 
dialog.  It  uses  semantic  information  and  context  to  understand 
discourse  and  to  disambiguate  sentences.  It  combines  a  complete 
syntactic  analysis  of  each  sentence  with  a  "heuristic 
understander"  which  uses  different  kinds  of  information  about  a 
sentence/  other  parts  of  the  discourse/  and  general  information 
about  the  world  in  deciding  what  the  sentence  means. 

It  Is  based  on  the  belief  that  a  computer  cannot  deal 
reasonably  with  language  unless  It  can  "understand”  the  subject 
it  Is  discussing.  The  program  Is  given  a’  detailed  model  of  the 
knowledge  needed  by  a  simple  robot  having  only  a  hand  and  an 
eye.  We  can  give  it  instructions  to  manipulate  toy  objects/ 
interrogate  it  about  the  scene/  and  give  it-  Information  It  will 
use  In  deduction.  In  addition  to  knowing  the  properties  of  toy 
objects/  the  program  has  a  simple  model  of  Its  own  mentality. 

It  can  remember  and  discuss  Its  plans  and  actions  as  well  as 
carry  them  out.  It  enters  into  a  dialog  with  a  person/ 
responding  to  English  sentences  with  actions  and  English 
replies/  and  asking  for  clarification  when  Its  heuristic 
programs  cannot  understand  a  sentence  through  use  of  context  and 
physical  knowledge. 

In  the  programs/  syntax/  semantics  and  Inference  are 
Integrated  in  a  "vertical"  system  in  which  each  part  Is 
constantly  communicating  with  the  others.  V/e  nave  explored 
several  techniques  for  Integrating  the  large  bodies  of  complex 
knowledge  needed  to  understand  language.  We  use  Systemic 
Grammar/  a  type  of  syntactic  analysis  which  is  designed  to  deal 
with  semantics.  Rather  than  concentrating  on  the  exact  form  of 
rules  for  the  shapes  of  linguistic  constituents/  It  Is 
structured  around  choices  for  conveying  meaning.  It  abstracts 
the  relevant  features  of  the  linguistic  structures  which  are 
important  for  Interpreting  their  meaning. 

We  represent  many  kinds  of  knowledge  In  the  form  of 
procedures  rather  than  tables  of  rules  or  lists  of  patterns.  By 
developing  special  procedural  languages  for  grammar/  semantics/ 
and  deductive  logiC/  we  gain  the  flexibility  and  power  of 
programming  languages  while  retaining  the  regularity  and 
understandab 1 1  I ty  of  simpler  rule  forms.  Each  piece  of 
knowledge  can  be  a  procedure/  and  can  call  on  any  other  piece  of 
knowledge  In  the  system. 

Thesis  Supervisor:  Seymour  A.  Papert/  Professor  of  Applied 
Mathemat I cs 
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Note  on  the  Organization  of  the  Text 

This  paper  was  written  to  be  readable  at  several  different 
levels  of  detail.  The  Preface  is  intended  to  be  understandable  * 

to  a  layman  with  no  special  knowle<lga  of  linguistics  or 
computers,  and  gives  a  general  idea  of  the  purposes  and  methods. 

The  Introduction  gives  somewhat  more  detail,  along  with  a  sample 
of  a  dialog  with  the  program.  It  explains  more  specifically  how 
the  program  is  organized,  and  what  theories  were  used  in  its  • 

construct! on . 

The  remaining  chapters  each  contain  a  general  introductory 
section,  followed  by  further  sections  explaining  the  details  of 
the  programs  and  theories.  It  should  be  possible  to  get  a  good 
basic  understanding  of  the  paper  by  reading  the  introduction, 
followed  by  the  first  section  of  each  chapter.  In  addition, 
there  was  an  attempt  to  keep  sections  independent  so  that  parts 
of  the  paper  could  be  selected  separately.  For  example,  the 
description  of  the  PROHRAMMAR  language,  and  the  description  of 
our  grammar  of  English  can  be  read  independently. 

Because  of  this  structure,  some  parts  of  the  paper  are 
redundant  --  a  particularly  important  principle  or  relevant 
example  may  be  repeated  at  all  three  levels  of  explanation,  and 
in  the  different  sections  where  it  is  needed.  However  it  is 
hoped  that  it  will  allow  the  reader  to  go  into  the  content  as 
deeply  as  he  wants  without  getting  bogged  down  in  detail. 

Throughout  the  text,  single-spaced  paragraphs  are  used  for 
emphas i s . 
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Preface  --  Talking  to  Computers 

Computers  are  being  used  today  to  take  over  many  of  our 
jobs.  They  can  perform  millions  of  calculations  in  a  second^ 
handle  mountains  of  data,  and  perform  routine  office  work  much 
more  efficiently  and  accurately  than  humans.  But  when  it  comes 
to  telling  them  what  to  do,  they  are  tyrants.  They  insist  on 
being  spoken  to  in  special  computer  languages,  and  act  as  though 
they  can't  even  understand  a  simple  English  sentence. 

Let  us  envision  a  new  way  of  using  computers  so  they  can 
take  instructions  in  a  way  suited  to  their  Jobs.  We  will  talk 
to  them  just  as  we  talk  to  a  research  assistant,  librarian,  or 
secretary,  and  they  will  carry  out  our  commands  and  provide  us 
with  the  information  we  ask  for.  If  our  instructions  aren't 
clear  enough,  they  will  ask  for  more  information  before  they  do 
what  we  want,  and  this  dialog  will  all  be  in  English. 

Why  isn't  this  being  done  now?  Aren't  computers 
translating  foreign  languages  and  conducting  psychiatric 
interviews?  Surely  it  must  be  easier  to  understand  simple 
requests  for  information  than  to  understand  Russian  or  a 
person's  psychological  problems.  The  key  to  this  question  Is  in 
understanding  what  we  mean  by  "understanding".  Computers  are- 
very  adept  at  manipulating  symbols  --  at  shuffling  around 
strings  of  letters  and  words,  looking  them  up  in  dictionaries, 
and  rearranging  them.  In  the  early  days  of  computing,  some 
people  thought  that  simple  applications  of  these  capabilities 
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might  be  just  what  was  needed  to  translate  languages.  The 
government  supported  a  tremendous  amount  of  research  irito  « 

language  translation,  and  a  number  of  projects  tried  different 
approaches.  In  1066  a  committee  of  the  National  Academy  of 
Sciences  wrote  a  report  evaluating  this  research  and  announced 
sadly  that  it  had  been  a  failure.  Every  project  ran  up  against 
the  same  brick  wall  --  the  computer  didn't  know  what  it  was 
talking  about. 

V/hen  a  human  reader  sees  a  sentence,  he  uses  knowledge  to 
understand  it.  This  includes  not  only  grammar,  but  also  his 
knowledge  about  words,  the  context  of  the  sentence,  and  most 
important,  his  knowledge  about  the  subject  matter.  A  computer 
program  supplied  with  only  a  grammar  for  manipulating  the  syntax 
of  language  could  not  produce  a  translation  of  reasonable 
qua  1 i ty . 

Everyone  has  heard  the  story  of  the  computer  that  tried  to 
translate  "The  spirit  Is  willing  but  the  flesh  is  weak."  into 
Russian  and  came  out  with  something  which  meant  "The  vodka  is 
strong  but  the  meat  is  rotten."  Unfoi'tunate  I  y  the  problem  is 
much  more  serious  than  just  choosing  the  wrong  words  when 
translating  idioms.  It  Isn't  always  possible  to  even  choose  the 
right  grammatical  forms.  We  may  want  to  translate  the  tv/o 
sentences  "A  message  was  delivered  by  the  next  visitor."  and  "A 
message  was  delivered  by  tne  next  day."  If  we  are  translating  * 

into  a  language  which  doesn't  have  the  equivalent  of  our 
"passive  voice",  we  may  need  to  completely  rearrange  the  first  ^ 

sentence  into  something  corresponding  to  "The  next  visitor 
delivered  a  message."  The  other  sentence  might  become  something 
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n ke  "Before  the  next  day,  someone  delivered  a  message."  If  the 
computer  picks  the  wrong  form  for  either  sentence,  the  meaning 
Is  totally  garbled.  In  order  to  make  the  choice.  It  has  to  know 
that  visitors  are  people  who  can  deliver  messages,  while  days 
are  units  of  time  and  cannot.  It  has  to  "understand"  the 
meanings  of  the  words  "day"  and  "visitor". 

In  other  cases  the  problem  Is  even  worse.  Even  a  knowledge 
of  the  meanings  of  words  Is  not  enough.  Let  us  try  to  translate 
the  two  sentences; 

"The  city  councllmen  refused  to  give  the  women  a 
permit  for  a  demonstration  because  they  feared  violence." 
and 

"The  city  councllmen  refused  to  give  the  women  a 
permit  for  a  demonstration  because  they  advocated 
revol ut ion . " 

If  we  are  translating  Into  a  language  (like  French)  which 
has  different  forms  of  the  word  "they"  for  masculine  and 
feminine,  we  cannot  leave  the  reader  to  figure  out  who  "they" 
refers  to.  The  computer  must  make  a  choice  and  If  It  chooses 
wrong,  the  meaning  of  the  sentence  Is  changed.  To  make  the 
decision.  It  has  to  have  more  than  the  meanings  of  words.  It 
has  to  have  the  information  and  reasoning  power  to  realize  that 
city  councllmen  are  usually  staunch  advocates  of  law  and  order, 
but  are  hardly  likely  to  be  revolutionaries. 

For  some  uses,  it  Isn't  really  necessary  to  understand 
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much.  There  has  been  much  publicity  about  a  well  known 

"psycniatrl  St**  program  named  ELIZA.  It  imitates  the  kind  of 

Rogerlan  psychiatrist  who  would  respond  to  a  question  like  •*What 

time  is  It?**  by  asking  **Why  do  you  want  to  know  what  time  It 

Is?**  or  muttering  **You  vMant  to  know  what  time  ft  Is!*'.  This  can  ^ 

be  done  without  much  understanding.  All  it  needs  to  do  Is  take 

the  words  of  the  question  and  rearrange  them  in  some  simple  way 

to  make  a  new  question  or  statement.  In  addition  It  recognizes 

a  few  key  words^  to  respond  with  a  fixed  phrase  whenever  the 

patient  uses  one  of  them.  If  the  patient  types  a  sentence 

containing  the  word  ’’mother*’^  the  program  can  say  "Tell  me  more 

about  your  family!**.  In  fact/  this  Is  just  how  the  psychiatrist 

program  works.  But  very  often  it  doesn't  work  --  its  answers 

are  silly  or  meaningless  because  It  Isn't  really  understanding 

the  content  of  what  Is  being  said. 

If  we  really  want  computers  to  understand  us,  we  need  to 
give  them  the  ability  to  use  more  knowledge.  In  addition  to  a 
grammar  of  the  language^  they  need  to  have  all  sorts  of 
knowledge  about  the  subject  they  are  discussing/  and  they  have 
to  use  reasoning  to  combine  facts  In  the  right  way  to  understand 
a  sentence  and  respond  to  it.  The  process  of  understanding  a 
sentence  has  to  combine  grammar/  semantics/  and  reasoning  in  a 
very  Intimate  way/  calling  on  each  part  to  help  with  the  others. 

This  thesis  explores  one  way  of  giving  the  computer 
knowledge  In  a  flexible  and  usable  form.  In  addition  to  basic  ir 

tools  and  operations  for  understanding  language/  we  give  the 
computer  specialized  Information  about  the  English  language/  the 
words  we  will  use,  and  the  subject  we  will  discuss.  In  most 
earlier  computer  programs  for  understanding  language,  there  have 
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been  attempts  to  use  these  kinds  of  Information  In  the  form  of 
lists  of  ruleS/  patterns/  and  formulas. 

In  our  system,  knowledge  Is  expressed  as  programs  In 
special  languages  designed  for  syntax,  semantics,  and  reasoning. 
These  languages  have  the  control  structure  of  a  programming 
language,  with  the  statements  of  the  language  explicitly 
controlling  the  process.  This  makes  It  possible  to  relate  the 
different  areas  of  knowledge  more  directly  and  completely.  The 
course  of  the  understanding  process  can  be  determined  directly 
by  special  knowledge  about  a  word,  a  syntactic  construction,  or 
a  particular  fact  about  the  world. 

This  gives  greater  flexibility  than  a  program  with  a  fixed 
control  structure.  In  which  the  specific  knowledge  can  only 
Indirectly  control  the  process  of  understanding.  By  using 
languages  specially  developed  for  representing  these  kinds  of 
knowledge.  It  Is  possible  for  a  person  to  "teach"  the  computer 
what  It  needs  to  know  about  a  new  subject  or  a  new  vocabulary 
v/Ithout  being  concerned  with  the  details  of  how  the  computer 
will  go  about  using  the  knowledge  to  understand  language.  For 
simple  Information,  It  Is  even  possible  to  just  "tell"  the 
computer  In  English.  Other  systems  make  It  possible  to  "tell" 
the  computer  new  things  by  allowing  It  to  accept  only  very 
specialized  kinds  of  Information.  By  representing  Information 
as  programs,  we  can  greatly  expand  the  range  of  things  which  can 
be  Included. 

The  best  way  to  experiment  with  such  Ideas  Is  to  write  a 
worki.ng  program  which  can  actually  understand  language.  We 
would  like  a  program  which  can  answer  questions,  carry  out 
commands,  and  accept  new  Information  In  English.  If  we  really 
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much.  There  has  been  much  publicity  about  a  well  known 
"psychiatrist*  program  named  ELIZA.  It  imitates  the  kind  of 
Rogerian  psychiatrist  who  would  respond  to  a  question  like  "What 
time  is  it?"  by  asking  "Why  do  you  want  to  know  what  time  it 
is?"  or  muttering  "You  want  to  know  what  time  it  is!".  This  can 
be  done  without  much  understanding.  All  it  needs  to  do  Is  take 
the  words  of  the  question  and  rearrange  them  in  some  simple  way 
to  make  a  new  question  or  statement.  In  addition  it  recognizes 
a  few  key  words,  to  respond  with  a  fixed  phrase  whenever  the 
patient  uses  one  of  them.  If  the  patient  types  a  sentence 
containing  the  word  "mother",  the  program  can  say  "Tell  me  more 
about  your  family!".  In  fact,  this  is  just  hew  the  psychiatrist 
program  works.  But  very  often  it  doesn't  work  its  answers 
are  silly  or  meaningless  because  it  Isn't  really  understanding 
the  content  of  what  is  being  said. 

If  we  really  want  computers  to  understand  us,  we  need  to 
give  them  the  ability  to  use  more  knowledge.  In  addition  to  a 
grammar  of  the  language,  they  need  to  have  all  sorts  of 
knowledge  about  the  subject  they  are  discussing,  and  they  have 
to  use  reasoning  to  combine  facts  in  the  right  way  to  understand 
a  sentence  and  respond  to  it.  The  process  of  understanding  a 
sentence  has  to  combine  grammar,  semantics,  and  reasoning  in  a 
very  intimate  way,  calling  on  each  part  to  help  with  the  others. 

This  thesis  explores  one  way  of  giving  the  computer 
knowledge  In  a  flexible  and  usable  form.  In  addition  to  basic 
tools  and  operations  for  understanding  language,  we  give  the 
computer  specialized  information  about  the  English  language,  the 
words  we  will  use,  and  the  subject  we  will  discuss.  In  most 
earlier  computer  programs  for  understanding  language,  there  have 
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been  attempts  to  use  these  kinds  of  Information  In  the  form  or’ 

*  lists  of  ruleS/  patterns,  and  formulas. 

In  our  system,  knowledge  Is  expressed  as  programs  In 
special  languages  designed  for  syntax,  semantics,  and  reasoning. 
These  languages  have  the  control  structure  of  a  programming 
^  language,  with  the  statements  of  the  language  explicitly 

controlling  the  process.  This  makes  It  possible  to  relate  the 
different  areas  of  knowledge  more  directly  and  completely.  The 
course  of  the  understanding  process  can  be  determined  directly 
by  special  knowledge  about  a  word,  a  syntactic  construction,  or 
a  particular  fact  about  the  world. 

This  gives  greater  flexibility  than  a  program  with  a  fixed 
control  structure.  In  which  the  specific  knowledge  can  only 
Indirectly  control  the  process  of  understanding.  By  using 
languages  specially  developed  for  representing  these  kinds  of 
knowledge.  It  Is  possible  for  a  person  to  "teach"  the  Computer 
what  It  needs  to  know  about  a  new  subject  or  a  new  vocabulary 
without  being  concerned  with  the  details  of  how  the  computer 
will  go  about  using  the  knowledge  to  understand  language.  For 
simple  Information,  It  Is  even  possible  to  just  "tell"  the 
computer  in  English.  Other  systems  make  It  possible  to  "tell" 
the  computer  new  things  by  allowing  It  to  accept  only  very 
specialized  kinds  of  Information.  By  representing  Information 
as  programs,  we  can  greatly  expand  the  range  of  things  which  can 

*  be  Included. 

The  best  way  to  experiment  with  such  Ideas  Is  to  write  a 
working  program  which  can  actually  understand  language.  We 
would  I  I ke  a  program  which  can  answer  questions,  carry  out 
commands,  and  accept  new  Information  In  English.  If  we  really 
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want  it  to  understand  language#  we  must  give  it  knowledge  about 
the  specific  subject  we  want  to  talk  about. 

For  our  experiment#  we  pretended  that  we  were  talking  to  a 
simple  robot#  with  a  hand  and  an  eye  and  the  ability  to 
manipulate  toy  blocks  on  a  table.  We  can  say#  "Pick  up  a  block 
which  is  bigger  than  the  one  you  are  holding  and  put  it  in  the 
box."#  or  ask  a  sequence  of  questions  like  "Had  you  touched  any 
pyramid  before  you  put  the  green  one  on  the  little  cube?"  "V'/hen 
did  you  pick  It  up?"  "Why?"#  or  we  can  give  it  new  information 
like  "I  like  blocks  which  are  not  red#  but  I  don't  like  anything 
which  supports  a  pyramid."  The  "robot"  responds  by  carrying  out 
the  commands  (in  a  simulated  scene  on  a  display  screen  attached 
to  the  computer)#  typing  out  answers  to  the  questions#  and 
accepting  the  Information  to  use  in  reasoning  later  on. 

The  dialog  is  carried  out  by  typing  on  a  terminal  attached 
to  the  computer  tine-sharing  system.  There  are  a  number  of  hard 
technical  problems  in  getting  a  computer  to  communicate  by 
voice#  and  it  has  not  been  attempted. 

We  had  three  main  kinds  of  goals  in  writing  such  a  program. 
The  first  is  the  practical  goal  of  having  a  language¬ 
understanding  system.  Even  though  we  used  the  robot  as  our  test 
area#  the  language  programs  do  not  depend  on  any  special  subject 
matter#  and  they  have  been  adapted  to  other  uses. 

The  second  goal  is  gaining  a  better  understanding  of  what 
language  is  and  how  It  is  put  together.  To  write  a  program  we 
need  to  make  all  of  our  knowledge  about  language  vary  explicit# 
and  we  have • to  be  concerned  with  the  entire  language  process# 
not  just  one  area  such  as  syntax.  We  need  tha  most  advanced 
theories  which  linguists  and  others  have  developed#  and  we  must 
fit  them  together  to  get  the  program  working.  This  provides  a 
rigid  test  for  linguistic  theories#  and  leads  us  into  making  nev/ 
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theories  to  fill  the  places  where  the  old  ones  are  lacking.  To 
make  a  comp'jter  understand  language^  we  have  to  Increase  our 
knowledge  of  how  people  understand  language,  and  one  Important 
goal  is  the  Improvement  of  our  comprehension  of  language  and 
1 Ingul sties. 

More  generally,  we  want  to  understand  what  Intelligence  Is 
and  how  it  can  be  put  Into  computers.  Language  is  one  of  the 
most  complex  and  unique  of  human  activities,  and  understanding 
Its  structure  may  lead  to  a  better  theory  of  how  our  minds  work. 
The  techniques  needed  to  write  a  language-understanding  program 
may  be  useful  in  many  other  areas  of  intelligence  such  as 
vision,  mathematical  problem  solving,  and  game  playing.  All  of 
these  areas  have  similar  problems  of  integrating  large  amounts 
of  knowledge  Into  a  flexible  system. 

With  advances  In  artificial  Intelligence  we  will  some  day 
be  able  to  combine  all  of  these  areas  to  provide  computers  which 
are  not  tyrants,  but  can  understand  what  we  tell  them  and  do 
what  we  want  them  to  In  an  Intelligent  way. 
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Chapter  I  —  Introduction 

1.1  General  Description  •. 

This  paper  describes  a  system  for  the  understanding  of 
English  by  a  computer.  The  system  answers  questions^  executes 

V. 

commands,  and  accepts  Information  In  normal  English  dialog.  It 
uses  semantic  Information  and  context  to  understand  discourse 
and  to  disambiguate  sentences  both  syntactically  and 
semantically.  It  combines  a  complete  syntactic  analysis  of  each 
sentence  with  a  "heuristic  understander"  which  uses  different 
kinds  of  Information  about  a  sentence,  other  parts  of  the 
discourse,  and  general  Information  about  the  world  In  deciding 
what  a  sentence  mea;is. 

i k  is  based  on  the  belief  that  a  computer  cannot  deal 
reasonably  with  language  unless  't  can  "understand"  the  subject 
It  Is  discussing.  The  program  is  given  a  detailed  model  of  a 
small  part  of  knowledge,  and  It  can  understand  sentences 
concerning  the  subject  It  knows  about.  We  use  as  an  example  the 
knowledge  needed  by  a  simple  robot  consisisting  of  only  an  eye 
and  a  hand.  We  can  give  It  Instructions  to  manipulate  toy 
objects.  Interrogate  It  about  the  scer.e,  and  give  It  Information 
It  will  use  In  deduction.  ^ 

In  addition  to  knowing  the  properties  of  blocks,  hands, 
etc.,  the  program  has  a  simple  model  of  Its  own  mentality.  It  ^ 

can  remember  and  discuss  Its  plans  and  actions  as  well  as  carry 
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them  out.  It  enters  Into  a  true  dialog  v/lth  a  person# 
responding  to  English  sentences  with  actions  and  English 

M 

replies#  and  asking  for  clarification  when  Its  heuristic 
programs  cannot  disambiguate  a  sentence  through  use  of  context 
4  or  physical  knowledge. 
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IjJLiuL  Is.  Language? 

To  write  a  computer  program  which  understands  natural 

language,  we  need  to  understand  what  language  Is  and  what  It 
does.  It  should  be  approached  not  as  a  set  of  mathematical 
rules  and  symbols,  but  as  a  system  Intended  to  communicate  Ideas 
from  a  speaker  to  a  hearer,  and  we  want  to  analyze  how  It 
achieves  that  communication.  It  can  be  viewed  as  a  process  of 
translation  from  a  structure  of  "concepts"  In  the  mind  of  the 
speaker.  Into  a  string  of  sounds  or  written  marks,  and  back  Into 
concepts  In  the  mind  of  the  hearer. 

In  order  to  talk  about  concepts,  w2  must  understand  the 

Importance  of  mental  models  f see  <MInsky  1965>).  In  the  flood 

of  data  pouring  Into  cur  brains  every  moment,  people  recognize 
regular  and  recurrent  patterns.  From  these  we  set  up  a  mo  'el  of 
the  world  which  serves  as  a  framework  In  which  to  organize  our 
thoughts.  We  abstract  the  presence  of  particular  objects, 
having  properties,  and  entering  Into  events  and  relationships. 

Our  thinking  Is  a  process  of  manipulating  the  "concepts"  which 

make  up  this  model.  Of  course,  there  Is  no  way  of  actually 

observing  the  Internal  workings  of  a  person's  mind,  but  In 

Section  3.1  we  will  discuss  the  justification  for  postulating  ^ 

such  a  "model"  In  analyzing  the  human  use  of  language.  In 

Section  3.4  we  show  what  this  model  might  look  like  for  a  small 

area  of  knowledge,  and  describe  how  It  can  be  used  for 

reasoning. 
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When  we  communicate  with  others/  we  select  concepts  and 
patterns  from  the  model  and  map  them  onto  patterns  of  sound, 
which  are  then  reinterpreted  by  the  hearer  In  terms  of  his  own 
model.  A  theory  can  concentrate  on  either  half  of  this  process 
of  generation  and  Interpretation  of  language.  Even  though  a 
complete  theory  must  account  for  both.  Its  approach  Is  strongly 
colored  by  which  one  It  views  as  logically  primary.  Most 
current  theories  are  "generative”,  but  It  seems  more  Interesting 
to  look  at  the  Interpretive  side  (see  <Wlnograd  1969>  for  a 
discussion  of  the  Issues  Involved).  The  first  task  a  child 
faces  Is  understanding  rather  than  producing  language,  and  he 
understands  many  utterances  before  he  can  speak  any.  At  every 
stage  of  development,  a  person  can  understand  a  much  wider  range 
of  patterns  than  he  produces  (see  <Mlller>,  Chapter  7).  A 
program  Is  not  a  detailed  psychological  tneoryof  how  a  person 
Interprets  language,  but  there  may  In  fact  be  very  Informative 
parallels,  and  at  a  high  level.  It  may  be  a  reasonable 
simulation. 

Language  understanding  Is  a  kind  of  Intellectual  activity. 
In  which  a  pattern  of  sounds  or  written  marks  Is  Interpreted 
Into  a  structure  of  concepts  In  the  mind  of  the  Interpreter.  We 
cannot  think  of  I t  as  being  done  In  simple  steps:  1.  Parse;  2. 
Understand  the  meaning;  3.  Think  about  the  meaning.  The  way  we 
parse  a  sentence  Is  controlled  by  a  continuing  semantic 
Interpretation  which  guides  us  In  a  "meaningful"  direction. 
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V«'hen  we  see  the  sentence  "He  gave  the  boy  plants  to  water," 
we  don't  get  tangled  up  In  an  I nterpreta^ t on  which  would  be 
parallel  to  "He  gave  the  house  plants  to  charity."  The  phrase 
"boy  plants"  doesn't  make  sense  like  "house  plants"  or  "boy 
scouts",  so  we  reject  any  parsing  which  would  use  It. 

Syntax,  semantics,  and  Inference  must  be  Integrated  In  a 
close  way,  so  that  they  can  share  In  the  responsibility  for 
Interpretation.  Our  program  must  Incorporate  the  flexibility 
needed  for  this  kind  of  "vertical"  system  In  which  each  part  Is 
constantly  talking  to  the  others.  We  have  explored  several 
techniques  for  Integrating  the  large  bodies  of  complex  knowledge 
needed  to  understand  language.  Two  are  particularly  Important. 

First,  we  use  a  type  of  syntactic  analysis  which  Is 
designed  to  deal  with  questions  of  semantics.  Rather  than 
concentrating  on  the  exact  form  of  rules  for  shuffling  around 
linguistic  symbols,  it  studies  the  way  language  Is  structured 
around  choices  for  conveying  meaning.  The  parsing  of  a  sentence 
Indicates  Its  detailed  structure,  but  more  important  it 
abstracts  the  "features"  of  the  linguistic  components  which  are 
Important  for  Interpreting  their  meaning.  The  syntactic  theory 
Includes  an  analysis  of  the  way  language  Is  structured  to  convey 
Information  through  systematic  choices  of  features.  The  other 
parts  of  the  program  can  look  directly  ct  these  relevant 
features,  rather  than  having  to  deal  with  minor  details  of  the 
way  the  pars«ng  tree  looks. 
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Second/  we  represent  knowledge  In  the  form  of  procedures 
rather  than  tables  of  rules  or  lists  of  patterns.  By  developing 
special  procedural  languages  for  grammar#  semantics#  and 
deductive  logic#  we  gain  the  flexibility  and  power  of  programs 
while  retaining  the  regularity  and  understandlbl 1  I ty  of  simpler 
rule  forms.  Since  each  piece  of  knowledge  can  be  a  procedure# 

It  can  call  on  any  other  piece  cf  knowledge  of  any  type.  The 
parser  can  call  semantic  routines  to  see  whether  the  line  of 
parsing  It  Is  following  makes  any  sense#  and  the  semantic 
routines  can  call  deductive  programs  to  see  whether  a  particular 
phrase  makes  sense  In  tie  current  context.  This  Is  particularly 
Important  In  handling  discourse#  where  the  Interpretation  of  a 
sentence  containing  such  things  as  pronouns  may  depend  In 
complex  -vays  on  the  preceding  discourse  and  knowledge  of  the 
sub  •!ct  matter. 

This  dual  view  of  programs  as  data  and  data  as  programs 
would  not  have  been  possible  In  traditional  programming 
languages.  The  special  languages  for  expressing  facts  about 
grammar#  semantics#  and  deduction  are  embedded  In  LISP#  and 
share  with  It  the  capability  of  Ignoring  the  artificial 
distinction  between  programs  and  data. 
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We  can  divide  the  process  of  language  understanding  Into 
three  main  areas  --  syntax#  semantics#  and  Inference.  As 
mentioned  above#  these  areas  cannot  be  viewed  separately  but 
must  be  understood  as  part  of  an  Integrated  system. 

Nevertheless#  we  have  organized  our  programs  along  these  baste 
lines#  since  each  area  has  Its  own  tools  and  concepts  which  make 
It  useful  to  write  special  programs  for  It. 

Listing  these  aspects  of  language  understanding  separately 
Is  somewhat  misleading#  as  It  Is  the  Interconnection  and 
Interplay  between  them  which  makes  the  system  possible.  Our 
parser  does  not  parse  a  sentence#  then  hand  It  off  to  an 
Interpreter.  As  It  finds  each  piece  of  the  syntactic 
structure#  It  checks  Its  semantic  Intepretatlon#  first  to  see  If 
It  Is  plausible#  then  (If  possible)  to  see  If  It  Is  In  accord 
with  the  system's  knowledge  of  the  world#  both  specific  and 
general.  This  has  been  done  In  a  limited  way  by  other  systems# 
but  In  our  program  It  Is  ^n  Integral  part  of  understanding  at 
every  level. 

A.  Syntax 

First  we  need  a  system  for  the  syntactic  analysis  of  Input 
sentences#  and  any  phrases  and  other  non-sentences  we  might  want 
In  our  dialogs.  There  have  been  many  different  parsing  systems 
developed  by  different  language  projects#  each  based  on  a 
particular  theory  of  grammar.  The  type  af  grammar  chosen  plays 
a  major  role  In  the  type  of  semantic  analysis  which  can  be 
carried  out.  A  language  named  PROGRAMMAR  was  designed 
specifically  to  fit  the  type  of  analysis  used  In  this  system. 

It  differs  from  other  parsers  In  that  the  grammar  Itself  Is 
written  In  the  form  of  a  collection  of  programs#  and  the  parsing 


system  Is  In  effect  an  Interpreter  for  the  language  used  In 
writing  those  programs. 

Having  chosen  the  "type  of  grammar"#  we  need  to  formalize  a 
grammar  for  parsing  sentences  In  a  particular  language.  Our 
system  Includes  a  comprehensive  grammar  of  English  following  the 
lines  of  systemic  F.rarmar  (see  Section  2.3).  This  type  of 
grammar  Is  well  suited  to  a  complete  language-understanding 
system  since  It  views  language  as  a  system  for  conveying  meaning 
and  Is  highly  oriented  toward  semantic  analysis.  It  Is  Intended 
to  cover  a  wide  range  of  syntactic  constructions;  one  basic 
criterion  for  the  completeness  of  the  grammar  Is  that  a  person 
with  no  knowledge  of  the  system  or  Its  grammar  should  be  able  to 
type  any  reasonable  sentence  within  the  limitations  of  the 
vocabulary  and  expect  It  to  be  understood. 

B.  Inference 

At  the  other  end  of  the  linguistic  process  we  need  a 
deductive  system  which  can  be  used  not  only  for  such  things  as 
resolving  ambiguities  and  answering  questions#  but  also  to  allow 
the  parser  to  use  deduction  In  trying  to  parse  a  sentence.  The 
system  uses  PLANNER#  a  deductive  system  designed  by  Carl  Hewitt 
(see  <Hewltt  1969#  1970>)  which  Is  based  on  a  philosophy  very 
similar  to  the  general  mood  of  this  project.  Deduction  In 
PLANNER  Is  not  carried  out  In  the  traditional  "logistic 
framework"  In  which  a  general  procedure  acts  on  a  set  of  axioms 
or  theorems  expressed  In  a  formal  system  of  logic.  Instead# 
each  theorem  Is  In  the  form  of  a  program#  and  the  deductive 
process  can  be  directed  to  any  desired  extent  by  "Intelligent 
theorems."  PLANNER  Is  actually  a  language  for  the  writing  of 
those  theorems. 

This  deductive  system  must  be  given  a  model  of  the  world# 
with  the  concepts  and  knowledge  needed  to  make  Its  deductions. 
Useful  language-understanding  can  occtr  only  when  a  program  (or 
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person)  has  an  adequate  understanding  of  the  subject  he  Is 
talking  about.  We  will  not  attempt  to  understand  arblttary 
sentences  talking  about  unknown  subjects^  but  Instead  will  give 
the  system  detailed  knowledge  about  a  particular  subject  —  In 
this  case,  the  simple  robot  world  of  children's  toy  blocks  and 
some  other  common  objects.  The  deductive  system  has  a  double 
task  of  solving  goal -problems  co  plan  and  carry  out  actions  for 
a  robot  within  this  worlds  and  then  talking  about  what  It  Is 
doing  and  what  the  scene  looks  like.  We  want  the  robot  to 
discuss  Its  plans  and  actions  as  well  as  carry  them  out.  We  can 
ask  questions  not  only  about  physical  happenings,  but  also  about 
the  robot's  goals.  V^e  can  ask  "Why  did  you  clear  off  that 
block?"  or  "How  did  you  do  It?".  This  means  that  the  model 
Includes  not  only  the  properties  of  blocks,  hands,  and  tables, 
but  a  model  of  the  robot  mind  as  well.  We  have  written  a 
collection  of  PLANNER  theorems  and  data  called  BLOCKS, 
describing  the  world  of  toy  blocks  seen  and  manipulated  by  the 
robot,  and  the  knowledge  It  needs  to  work  with  that  world,  (see 
Section  3.4).  Figure  1  shows  a  typical  scene. 

C.  Semantics 

To  connect  the  syntactic  form  of  the  sentence  to  Its 
meaning,  we  need  a  semantic  system  which  provides  primitive 
operations  relevant  to  semantic  analysis.  This  Includes  a 
language  In  which  we  can  easily  express  the  meanings  of  words 
and  syntactic  constructions.  The  system  Includes  mechanisms 
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Figure  1 
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for  setting  up  simple  types  of  semantic  networks  and  using 
deductions  from  them  as  a  first  phase  of  semantic  analysis.  For 
example,  the  network  could  Include  the  Information  that  a 
"block*'  Is  a  physical  object,  while  a  "bloc"  Is  a  political 
object,  and  the  definition  of  the  word  "support"  could  use  this 
Information  In  choosltig  the  correct  meanings  for  the  sentences; 

The  red  block  supports  the  pyramid.  and 

The  red  bloc  supports  Egypt. 

More  Important,  the  meaning  of  a  word  or  construction  Is 
also  defined  In  the  form  of  a  program  to  be  Interpreted  In  a 
semantic  language.  It  Is  this  procedural  aspect  of  semantics 
which  Is  missing  In  most  other  theories,  which  limit  themselves 
to  a  particular  type  of  network  or  relational  structure.  The 
meaning  selected  for  a  word  can  depend  on  any  aspect  of  the 
sentence,  tha  discourse,  or  the  world.  In  deciding  on  the 
meaning  of  "one"  In  "Pick  up  the  green  one",  we  need  a  program 
which  can  examine  past  sentences.  This  program  Is  Included  as 
part  of  the  definition  of  the  word  "one". 

The  semantic  system  Includes  a  powerful  heuristic  program 
for  resolving  ambiguities  and  determining  the  meaning  of 
references  In  discourse.  In  almost  every  sentence,  reference  Is 
made  either  explicitly  (as  with  pronouns)  or  Implicitly  (as  with  » 

the  word  "too")  to  objects  and  concepts  not  explicitly  mentioned 
In  that  sentence.  To  Interpret  these,  the  program  must  have  at 
Its  disposal  not  only  a  detailed  grammatical  analysis  (to  check 
for  such  things  as  parallel  constructions),  but  also  a  powerful 
deductive  capacity  (to  see  which  reference  assignments  are  * 

logically  plausible),  and  a  thorough  knowledge  of  the  subject  It 
Is  discussing  (to  see  which  Interpretations  are  reasonable  In 
the  current  situation). 
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In  order  to  deal  with  language  In  a  human  way#  we  must  take 
Into  account  all  sorts  of  discourse  knowledge.  In  addition  to 
remembering  the  Immediately  previous  sentences  for  such  things 
as  pronoun  references#  the  system  must  remember  what  things  have 
been  mentioned  throughout  the  discussion#  so  that  a  reference  to 
"the  pyramid"  will  mean  "the  pyramid  we  mentioned  earlier"  even 
If  there  are  several  pyramids  In  the  scene. 

In  addition  the  system  must  have  some  knowledge  of  the  way 
a  person  will  communicate  with  It.  If  we  ask  "Is  there  a  block 
on  a  green  table?"  "What  color  Is  It?"#  the  word  "It"  refers  to 
the  block.  But  If  we  had  asked  "Is  there  a  green  block  on  a 
table?"  "What  color  Is  It?"#  "It"  must  refer  to  the  table  since 
we  would  not  ask  a  question  which  we  had  answered  ourselves  In 
the  previous  sentence. 

Our  semantic  system  works  with  a  base  of  knowledge  about 
simple  semantic  features  In  the  subject  domain#  and  with  a 
collection  of  definitions  for  Individual  words.  These 
definitions  are  written  In  a  "semantics  language"  which  allows 
simple  words  to  be  defined  In  a  straightforward  way#  while 
allowing  more  complex  words  to  call  on  arbitrary  amounts  of 
computation  to  Integrate  their  meaning  Into  the  sentence. 

Finally  we  need  a  generative  language  capacity  to  produce 
answers  to  questions  and  to  ask  questions  when  necessary  to 
resolve  ambiguities.  Grammatically  this  Is  much  less  demanding 
than  the  Interpretive  capacity#  since  humans  can  be  expected  to 


understand  a  wide  range  ctf  responses#  and  It  Is  possible  to 
express  almost  anything  In  a  syntactically  simple  way.  However# 
it  takes  a  sophisticated  semantic  and  deductive  capability  to 
phrase  things  In  a  way  which  Is  meaningful  and  natural  In 
discourse#  since  the  form  of  a  response  depends  on  both  the 
context  and  on  what  the  speaker  assumes  that  the  hearer  knows 
and  wants  to  know. 
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1,2  Implementation  of  the  System 
The  language  understanding  program  Is  written  In  LISP  to 
run  under  the  PDP-10  Incompatible  Time-sharing  System  at  the 
Artificial  Intelligence  Laboratory  at  MIT.  When  operating  with 
a  200  word  vocabulary  and  a  fairly  complex  scene/  It  occupies 
approximately  80K  of  core.  This  Includes  the  LISP  Interpreter# 
all  of  the  programs#  dictionary  entries#  and  data#  and  enough 
free  storage  to  remember  a  sequence  of  actions  and  to  handle 
complex  sentences  and  deductions.  See  Figure  3  for  a  more 
detailed  descripton  of  memory  usage. 

The  program  Is  organized  as  Indicated  In  Figure  2.  (Arrows 
Indicate  that  one  part  of  the  program  calls  another  directly): 


1.  MONITOR  Is  a  small  LISP  program  which  calls  the  basic 
parts  of  the  system.  Since  the  system  Is  organized  vertlcaly# 
most  of  the  communication  between  components  Is  done  directly# 
and  the  monitor  Is  called  only  at  the  beginning  and  end  of  the 
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Storage  Allocation  for  Language  Understanding  Program 
In  Thousands  of  PDP-10  words 

Note;  Approximately  12  thousand  additional  words  of  free 
storage  are  necessary  for  a  dialog  like  the  one  described  In 
Section  1.3.  As  the  length  of  dialog  or  complexity  of  the 
actions  Is  Increased,  more  free  storage  Is  needed. 


Figure  2  --  Memory  Requirements 
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understanding  process. 

2.  INPUT  Is  a  LISP  program  vhich  accepts  typed  Input  In 
normal  English  orthography  and  punctuation,  looks  up  words  In 
the  dictionary,  performs  morphemic  analysis  (e.g.  realizing  that 
"running"  Is  the  "Ing"  form  of  the  word  "run",  and  modifying  the 
dictionary  definition  accordingly),  and  returns  a  string  of 
words,  together  with  their  definitions.  This  Is  the  Input  with 
which  the  grammar  works. 

3.  The  GRAMMAR  Is  the  main  coordinator  of  the  language 
understanding  process.  It  consists  of  a  few  large  programs 
written  In  PROGRAMMAR  to  handle  the  basic  units  of  the  English 
language  (such  as  clauses,  mun  groups,  preposl tonal  groups, 
etc.}.  There  are  two  PROGRAMMAR  compilers,  one  Wi.lch  compiles 
Into  LISP,  which  Is  run  Interpretlvely  for  easy  debugging,  and 
another  which  makes  use  of  the  LISP  compiler  to  produce  LAP 
assembly  code  for  efficiency. 

4.  SEMANTICS  Is  a  collection  of  LISP  programs  which  work  In 
coordination  with  the  GRAMMAR  to  Interpret  sentences.  In 
general  there  are  a  few  semantics  programs  corresponding  to  each 
basic  unit  In  the  grammar,  each  performing  one  phase  of  the 
analysis  for  that  unit.  These  semantics  programs  call  PLANNER 
to  make  use  of  deduction  In  Interpreting  sentences. 

5.  ANSWER  Is  another  collection  of  LISP  programs  which 
control  the  responses  of  the  system,  and  take  care  of 
remembering  the  discourse  for  future  reference.  It  contains  a 
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number  of  heuristic  programs  for  producing  answers  which  take 
the  discourse  Into  account,  both  In  deciding  on  an  answer  and  In 
figuring  out  how  to  express  It  In  fluent  English. 

6.  PROGRAMMAR  Is  a  parsing  system  which  Interprets  grammars 
written  In  the  form  of  programs.  It  has  mechanisms  for  building 
a  parsing  tree,  and  a  number  of  special  functions  for  exploring 
and  manipulating  this  tree  In  the  GRAMMAR  programs.  It  Is 
written  In  LISP. 

7.  The  DICTIONARY  actually  consists  of  two  parts.  The  first 
Is  a  set  of  syntactic  features  associated  with  each  word,  used 
by  the  GRAMMAR.  The  second  Is  a  semantic  definition  for  each 
word,  written  In  a  language  which  Is  Interpreted  by  the 
SEMANTICS  programs.  The  form  of  a  word's  definition  depends  on 
Its  word  class  (e.g.  the  definition  of  "two"  Is  "2").  There  are 
special  facilities  for  Irregular  forms  (like  "geese"  or 
"slept"),  and  only  the  definitions  of  root  words  are  kept,  since 
INPUT  can  analyze  a  variety  of  endings.  The  definitions  are 
actually  kept  on  the  LISP  property  list  of  the  word,  and 
dictionary  lookup  Is  handled  automatically  by  LISP. 

8.  The  system  has  a  network  of  SEMANTIC  FEATURES,  kept  on 
property  lists  and  used  for  an  Initial  phase  of  semantic 
analysis.  The  features  subdivide  the  world  of  objects  and 
actions  Into  simple  categories,  and  the  semantic  Interpreter 
uses  these  categories  to  make  some  of  Its  choices  between 
alternative  definitions  for  a  word  (for  example  "In"  would  mean 
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"contained  In"  when  applied  to  a  hollow  object  like  a  box/  but 
would  mean  "part  of"  when  applied  to  a  conglomerate  like  a 
stack#  as  In  "the  red  block  Jji  the  tall  stack"). 

9.  BLOCKS  Is  a  collection  of  PLANNER  theorems  which  contain 
the  system's  knowledge  about  the  properties  of  the  physical 
world#  and  the  methods  of  achieving  goals  and  making  deductions 
about  It.  It  Is  a  very  simple  model#  designed  less  for  a 
realistic  simulation  of  a  robot#  and  more  to  give  the  system  a 
world  to  talk  about  In  which  one  can  say  many  different  types  of 
th I ngs . 

10.  MOVER  Is  a  set  of  display  routines  written  In  LAP  which 
simulate  the  robot's  world  on  the  DEC  340  graphic  display 
attatched  to  the  PDP-10  computer.  It  displays  the  scene 
Indicated  by  the  DATA  currently  In  memory#  and  any  action  called 
for  Is  displayed  In  real  time#  so  the  human  can  get  visual 
feedback  from  his  commands  to  the  robot.  The  Input  sentences 
and  responses  are  also  displayed.  Figure  1  displays  a  typical 
scene. 

11.  PLANNER  Is  the  deductive  system  used  by  the  program#  and 
Is  used  at  all  stages  of  the  analysis#  both  to  direct  the 
parsing  process  and  to  deduce  facts  about  the  BLOCKS  world. 
Currently  the  system  uses  Micro-Planner#  an  abbreviated  version 
of  Carl  Hewitt's  original  language#  Implemented  by  Terry 
WInograd#  Gerald  Sussman#  and  Eugene  Charnlak  <Sussman>.  It  Is 
written  In  LISP. 
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12.  The  DATA  about  the  current  scene  Is  li  the  form  of 
PLANNER  assertions  describing  the  objects/  their  sizes#  shapes# 
colors#  and  locations.  The  system  assumes  It  has  some  sort  of 
robot  "eye"/  so  all  of  this  Information  Is  Initially  known  and 
does  not  have  to  be  entered  In  English. 

When  running  compiled#  the  system  Is  fast  enough  to  carry 
on  a  real-time  discourse.  Each  sentence  takes  from  5  to  20 
seconds  to  analyze  and  respond  to#  and  the  display  Is  designed 
to  move  at  the  speed  of  a  real  arm. 

The  program  listings  are  all  available  In  hard  copy  and 
DECtape#  but  before  Implementing  them  on  another  system#  It 
would  be  wise  to  read  about  the  special  features  of  MACLISP 
<Whlte>  which  may  be  In  conflict  with  other  LISP 
I mpl ementat I ons . 


9 
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1.3  Sample  Dialog 

This  section  contains  a  sample  of  a  dialog  with  the 
language  understanding  system.  It  carries  out  actions  and 
answers  questions  about  a  simple  scene  containing  a  table,  hand, 
and  box,  and  several  blocks  and  pyramids.  The  lines  In  lower 
case  were  typed  In  by  a  person,  and  the  lines  In  upper  case  are 
the  computer's  response.  Indented  lines  are  comments  describing 
some  of  the  features  being  exhibited. 

The  dialog  was  carried  out  In  real  time  with  the  response 
of  the  "robot"  displayed  on  a  CRT  screen.  Figures  are  Included 
to  show  the  state  of  the  scene  at  various  points. 


1.  pick  up  a  big  red  block. 


The  system  answers  "OK"  when  It  carries  out  a 
command.  In  order  to  pick  up  the  red  block. 

It  had  to  clear  It  off  by  finding  a  space  for 
the  green  one  and  moving  the  green  one  away. 
Figure  4  shows  the  scene  when  the  command  was 
given.  Figure  5  shows  the  green  block  being 
put  out  of  the  way,  and  Figure  6  shows  the  end 
resul t. 

2.  grasp  the  pyramid. 

I  DON'T  UNDERSTAND  WHICH  PYRAMID  YOU  MEAN. 

There  are  three  pyramids  In  the  scene,  so  the 
program  does  nothing. 

3.  find  a  block  which  Is  taller  than  the  one  you  are  holding 
and  put  It  Into  the  box. 

The  system  has  to  figure  out  what  Is  meant  by 
"one"  and  "It",  as  well  as  using. deduction  to 
deride  which  block  It  Is  holding. 


f 
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Figure  4 
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Figure  5 


Figure  6 


> 
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BY  ‘'IT",  I  ASSUME  YOU  MEAN  THE  BLOCK  WHICH  IS  TALLER  THAN  THE 
ONE  I  AM  HOLDING. 


OK. 


Printed  out  to  keep  us  Informed. 

The  hand  has  to  set  down  the  red  block  to  pick 
up  the  taller  blue  one.  Figure  7  shows  the 
result. 


4.  what  does  the  box  contain? 


THE  BLUE  PYRAMID  AND  THE  BLUE  BLOCK. 


5.  what  is  the  pyramid  supported  by? 

Earlier  the  system  responded  to  "the  pyramid" 
by  saying  "I  don't  understand  which  pyramid 
you  mean."  now#  It  knows  that  we  must  mean 
the  one  which  was  Just  mentioned. 


THE  BOX. 

6.  how  many  blocks  are  not  In  the  box? 

Negatives  such  as  "not"  and  "no"  are  handled 
whether  they  appear  In  clauses  or  noun  groups 
(such  as  "no  boxes"). 

FOUR  OF  THEM. 

7.  Is  at  least  one  of  them  narrower  than  the  one  which  I  told 
you  to  pick  up? 

The  phrase  "at  least  one  of  them"  must  be 
analyzed  to  refer  back  to  the  previous 
question,  and  "the  one  which..."  must  also 
refer  to  blocks.  Noun  groups  can  have 
complicated  numbers  such  as  "at  least 
one", "more  than  three",  "exactly  seven",  etc. 
The  memory  of  past  events  Is  used  to  deduce 
which  block  Is  "the  one  which  1  told  you  to 
pick  up".  Notice  the  complicated  grammatical 
structure  In  which  "the  one"  Is  actually  the 
object  of  a  clause  embedded  two  levels  deeper. 

YES,  THE  RED  CUBE. 

8.  Is  It  supported? 
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Figure  7 
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Passives  can  be  used  without  a  "by  ..." 
phrase. 

YES,  BY  THE  TABLE. 

9.  can  the  table  pick  up  bionics? 


The  semantic  rules  make  this  a  nonsensical 
question  which  therefore  must  be  false. 

10.  can  a  pyramid  be  supported  by  a  block? 


YES. 

The  deductive  system  finds  an  actual  example, 
so  It  knows  this  Is  possible. 

11.  can  a  pyramid  support  a  pyramid? 

I  DON'T  KNOW. 

The  question  makes  sense,  but  there  Is  no 
example  In  the  scene,  so  there  Is  no  easy  way 
to  tell  without  being  able  to  examine  the 
robot's  programs. 

12.  stack  up  two  pyramids. 

I  CAN'T. 

The  robot  tries  and  falls. 

13.  the  blue  pyramid  Is  nice. 

I  UNDERSTAND. 

This  data  Is  added  to  the  data  base. 

14.  I  like  blocks  which  are  not  red,  but  1  don't  like  anything 
which  supports  a  pyramid. 

I  UNDERSTAND. 


Two  new  theorems  are  created  for  proving 
things  about  "liking". 

15.  do  I  like  the  box? 


NO. 

The  box  supports  whatever  It  contains. 
Including  a  pyramid. 


Section  1.3  -  Page  42 


16.  do  I  like  anything  In  the  box? 

YES,  TWO  THINGS:  IHE  BLUE  BLOCK  AND  THE  BLUE  PYRAMID. 

Presumably  we  like  anything  which  Is  nice,  and 
the  blue  block  Is  certainly  not  red. 

17.  will  you  please  stack  up  both  of  the  red  blocks  and  either 
a  green  cube  or  a  pyraml--’? 


OK. 


Logical  connectives  such  as  "and"  "or" 

"either"  etc.  are  handled  In  both  the  grammar 
and  semantics.  A  corimand  Is  executed  even  If 
It  looks  like  a  question.  The  robot  carries 
out  the  comma<(d  by  the  series  of  steps  shown 
In  figures  8  through  11.  It  puts  a  green  cube 
on  the  large  red  block  (note  that  It  chooses 
the  green  cube  with  nothing  on  It),  then 
removes  the  small  pyramid  from  the  little  red 
cube,  so  that  It  can  use  that  cube  to  complete 
the  stack  as  specified. 


18.  which  cube  Is  sitting  on  the  table? 


THE  LARGE  GREEN  ONE  WHICH  SUPPORTS  THE  RED  PYRAMID. 

Ordinarily  the  system  would  be  satisfied  with 
calling  this  "a  big  green  cube",  but  since  we 
asked  "which  cube"  It  has  to  be  more  specific 
In  naming  It.  The  use  of  substitute  nouns 
(such  as  "one")  and  pronouns  In  answers  makes 
the  discourse  more  natural. 


19.  Is  there  a  large  block  behind  a  pyramid? 

YES,  THREE  OF  THEM:  A  LARGE  RED  ONE,  A  LARGE  GREEN  CUBE  AND 
THE  BLUE  ONE. 


20.  put  a  small  one  onto  the  green  cube  which  supports  a 
pyramid. 


"A  small  one"  must  contrast  with  the  earlier 
phrase  "a  large  block",  so  this  means  "a  small 
block" . 


OK. 


>■ 


21.  put  the  llttlest  pyramid  on  top  Of  It 
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Figure  8 


Figure  9 
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Figure  10 
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OK. 


Words  1  I ke 
d I ct I onary 
forms  1  I ke 
Figure  12, 


"llttlest"  are  not  In  the 

but  are  Interpreted  from  the  root 

"little".  The  scene  now  looks  like 


22.  does  the  shortest  thing  the  tallest  pyramid's  support 
supports  support  anything  green? 


The  grammar  can  handle  words  like  "support" 
which  are  both  verbs  and  nouns  without  getting 
confused/  and  can  understand  possess  I ves .  To 
compute  the  answer,  see  Figure  13  for  the 
scene. 


YES/  THE  GREEN  PYRAMID. 

23.  what  color  Is  It? 

BY  "IT"/  I  ASSUME  YOU  MEAN  THE  SHORTEST  THING  THE  TALLEST 
PYRAMID'S  SUPPORT  SUPPORTS. 

If  we  had  asked  "what  size  Is  It?",  the  system 
would  have  assumed  that  "It"  meant  the  green 
pyramid/  but  nobody  would  ask  the  color  of 
something  which  was  just  called  "the  green 
pyramid''/  so  It  had  to  look  farther  for  the 
reference  of  "It". 


RED 


24,  how  many  things  are  on  top  of  green  cubes? 

I'M  NOT  SURE  WHAT  YOU  MEAN  BY  "ON  TOP  OF"  IN  THE  PHRASE  "ON 
TOP  OF  GREEN  CUBES  ". 

DO  YOU  MEAN: 

1  -  DIRECTLY  ON  THE  SURFACE 

2  -  ANWHERE  ON  TOP  OF  ? 

When  there  are  several  Interpretations  and 
none  Is  clearly  better,  the  syrerm  has  to  ask 
the  user  for  more  Information.  The 
paraphrases  were  Included  with  the  dictionary 
definition  of  "on",  and  the  request  for  more 
Information  occurs  automatically  when  the 
heuristics  for  resolving  ambiguities  aren't 
sufficient.  A  request  like  this  can  also 
occur  when  the  meaning  of  a  pronoun  such  as 
"It"  or  "they"  Is  unclear. 
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THREE  OF  THEM. 

25.  had  you  touched  any  pyramid  before  you  put  the  green  one  on 
the  little  cube? 

YES^  THE  GREEN  ONE. 

The  system  deduces  from  Its  memory  what  event 
!s  being  referred  to,  and  can  relate  other 
events  to  It  In  time.  The  analysis  Includes 
complex  tenses  of  verb  groups,  and  there  are 
special  facilities  for  keeping  track  of  the 
various  forms  of  Irregular  verbs  like  "have”. 

26.  when  did  you  pick  It  up? 

WHILE  I  WAS  STACKING  UP  THE  RED  CUBE,  A  LARGE  RED  BLOCK  AND  A 
LARGE  GREEN  CUBE. 

A  time  Is  described  by  naming  the  main  goal 
which  was  belr.g  worked  on. 

27.  why? 

The  system  remembers  Immediately  previous 
sentences  to  understand  questions  like  this 
and  the  following  ones. 

TO  GET  RID  OF  IT. 

By  keeping  track  of  selected  parts  of  the 
original  subgoal  tree,  the  system  has  some 
understanding  of  Its  own  motives. 

2S.  why  did  you  do  that? 

TO  CLEAN  OFF  THE  RED  CUBE. 

"That"  refers  to  the  action  mentioned  In  the 
answer. 

29.  why  did  you  clear  off  that  cube? 

Here,  "that"  means  the  one  just  referred  to. 
TO  PUT  IT  ON  A  LARGE  GREEN  CUBE. 
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30.  why  did  you  do  that? 

TO  STACK  UP  THE  RED  CUBE  ^  A  LARGE  RED  BLOCK  AND  A  LARGE  GREEN 
CUBE. 

31.  why  did  you  do  that? 

BECAUSE  YOU  ASKED  ME  TO. 

It  re'iiembers  what  It  did,  not  how  the  request 
was  worded. 

32.  how  d  d  you  do  It? 

"It"  refers  to  the  action  mentioned  In  the 
last  question,  rather  than  the  answer. 

BY  PUTTING  A  LARGE  RED  BLOCK  ON  THE  TABLE  ;  THEN  LETTING  GO 
OF  IT  ;  THiN  PUTTING  A  LARGE  GREEN  CUBE  ON  IT  ;  THEN  LETTING  GO 
OF  THAT  CUBE  ;  THEN  PUTTING  THE  RED  CUBE  ON  THAT  CUBE  ;  THEN 
LETTING  GO  OF  THAT  CUBE. 

This  Is  a  description  of  the  top-level 
subgoals  used  to  achieve  the  goal. 

33.  how  many  objects  did  you  touch  while  you  were  doing  It? 

Time  references  can  r::fer  to  a  time  span, 
using  words  like  "while". 


FOUR  OF  THEM. 

34.  what  did  the  red  cube  support  before  you  started  to  clean 
It  off? 

THE  GREEN  PYRAMID. 

By  remembering  what  has  been  moved.  It  Is 
possible  to  deduce  things  about  earlier  states 
of  the  scene.  Vie  can  use  complex  nestings  of 
clauses  like  "before  you  started  to  clean  It 
off". 


35.  there  were  five  blocks  to  the  left  of  the  box  then. 

«  "Then"  must  refer  to  the  time  mentioned  In  the 

last  sentence. 

NO,  ONLY  FOUR  OF  THEM:  THE  RED  CUBE  ,  TWO  LARGE  GREEN  CUBES 
AND  A  LARGE  RED  BLOCK. 
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Earlier  In  the  dialog#  Information  about 
"liking"  was  accepted  at  face  value.  But  the 
system  knows  that  It  has  complete  Information 
about  about  the  basic  properties  of  objects# 
Including  location.  It  checks  any  statements 
about  location  or  other  such  properties  to  see 
whether  It  agrees  or  disagrees. 

36.  put  the  blue  pyramid  on  the  block  In  the  box. 

This  Is  syntactically  ambiguous#  but  Is 
understood  without  ambiguity  since  the  parsing 
corresponding  to  "put  the  blue  pyramid  which 
is  on  the  block  In  the  box."  does  not  make 
sense  In  the  current  scene#  while  "put  the 
blue  pyramid  on  the  block  which  Is  In  the  box" 
does.  We  can  see  the  sltutatlon  In  Figure  14# 
and  the  result  In  Figure  15. 

OK. 

37.  Is  there  anything  which  Is  bigger  than  every  pyramid  but  Is 

not  as  wide  as  the  thing  that  supports  It? 

The  system  understands  quantifiers  such  as 
"every"#  "any"#  "some"#  etc.#  and  structures 
such  as  "bigger  than"#  "as  wide  as"#  etc. 

Here  "It"  refers  to  a  noun  group  within  the 
same  sentence  but  two  levels  up  In  the  parsing 
tree . 

YES#  THE  BLUE  BLOCK. 

33.  does  a  steeple 

SORRY#  I  DON’T  KNOW  THE  WORD  "STEEPLE".  PLEASE  TYPE  TWO  SPACES. 

The  system  Interrupts  l.Ti<nedl  atel  y  when  1 1  sees 
a  word  It  can’t  figure  out.  The  two  spaces 
are  to  clear  the  Input  buffer  of  anything 
typed  after  the  message  was  sent  but  before 
the  typist  saw  It. 

40.  a  "steeple"  1 1'  a  stack  which  contains  twe  green  cubes  and  a 

pyramid. 

The  quote  marks  Indlcat'j  that  we  are  defining 
a  new  word. 
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0H  I/ze  B7cc^  i'^t  -f/ie  i>ojc. 


.-igure  J4 


•o 
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Utc  blue  J>^TL*mcC  cm-  ib^zc  bli^clL  ifte  bqjc. 

ot  T 


F I gure  15 
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I  UNDERSTAND. 

Ul.  a.'e  there  any  steeple:,  now? 


NO. 


The  new  definition  Is  used  as  a  description  to 
recognize  steeples. 


42.  build  one. 

OK. 

The  description  given  In  the  definition  Is 
enough  to  deduce  a  plan  for  building  new 
objects  as  well  as  recognizing  them.  The  word 
"one"  means  "a  steeple'*,  and  the  plan  Is 
carried  out.  First,  one  of  the  large  green 
cubes  Is  cleared  off  by  removing  the  red  cube 
and  two  pyramids.  Then  the  two  green  cubes 
and  the  tall  red  pyramid  are  stacked  up  to  fit 
the  description.  Figures  16  thrcjgh  19  show 
the  Initial  state,  some  Intermediate  steps, 
and  the  result. 

43.  call  the  biggest  block  "superblock". 


OK. 

We  can  also  give  objects  names. 

44.  have  you  picked  up  superblock  since  we  began? 
YES. 


45.  why  did  you  drop  It? 


BY  "IT",  I  ASSUME  YOU  MEAN  THE  SUPERBLOCK. 
I  CAN'T  EXPLAIN  A  NON-EXISTENT  EVENT. 


There  Is  no  record  In  memory  of  any  block 
being  dropped. 

46.  Is  there  anything  to  the  right  of  the  red  pyramid? 

YES,  FOUR  THINGS;  THE  BOX  ,  THE  RED  CUBE,  THE  BLUE  PYRAMID 
AND  THE  BLOCK  NAMED  SUPERBLOCK 


Once  an  object  has  a  name,  the  name  Is  used  to 
help  Identify  It  In  responses. 
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Figure  16 
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Figure  17 
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Figure  19 
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4  7.  thank  you . 
YOU’RE  WELCOMfl 


The  special  I d I om-def I n I ng  capabilities  allow 
us  to  Include  such  pleasantries  without 
modifying  the  grammar. 
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Chapter  2.  Syntax 

2.1  Basic  Approach  to  Syntax 
IjXJL  2Ild  leaning 

The  decision  to  consider  syntax  as  a  proper  study  devoid  of 
semantics  Is  a  basic  tenet  of  most  current  linguistic  theories. 
Language  Is  viewed  as  a  way  of  organizing  strings  of  abstract 
symbols/  and  competence  Is  explained  In  terms  of  symbol- 
manipulating  rules.  At  one  level  this  has  been  remarkably 
successful.  Rules  have  been  formulated  which  describe  In  great 
detail  how  most  sentences  are  put  together.  The  problem  comes 
In  trying  to  relate  those  symbols  and  structures  to  the  function 
of  language  as  a  vehicle  for  conveying  meaning.  The  same 
approach  whlc‘.  has  worked  so  v-^ell  in  accounting  for  the 
machinations  of  syntax  has  been  unable  to  provide  any  but  the 
most  rudimentary  and  unsatisfactory  accounts  of  semantics. 

The  problem  Is  not  that  current  theories  are  finding  wrong 
answers  to  the  questions  they  ask;  It  Is  that  they  are  asking 
the  wrong  questions.  Vihat  Is  needed  Is  an  approach  which  can 
deal  meaningfully  with  the  question  "How  Is  language  organized 
to  convey  meaning?"  rather  than  "How  are  syntactic  structures 
organized  when  viewed  In  isolation?". 

How  does  a  senten>';e  convey  meaning  beyond  the  meanings  of 
Indlvld!*al  words?  Here  Is  the  place  for  syntax.  The  structure 
of  a  sentence  can  be  viewed  as  the  result  of  a  series  of 
grammatical  choices  made  In  generating  It.  The  speaker  encodes 
meaning  by  choosing  to  build  the  sentence  with  certain 
"features"/  and  the  problem  of  the  hearer  Is  to  recognize  the 
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presence  of  those  features  and  Interpret  their  meaning. 

We  want  to  analyze  the  possible  choices  o^  features  and 
functions  which  grammatical  structures  can  have.  For  example, 
we  might  note  that  all  sentences  must  be  either  IMPERATIVE, 
DECLARATIVE,  or  a  QUESTION,  and  that  In  the  last  case  they  must 
choose  as  well  between  being  a  YES~NO  question  or  a  WH-  question 
containing  a  word  such  as  "why"  or  "which".  We  can  study  the 
way  In  which  these  features  of  sentences  are  organized  --  which 
ones  form  mutually  exclusive  sets  (called  "systems"),  and  which 
sets  depend  on  the  presence  of  other  features  (like  the  set 
containing  YES-NO  and  WH-  depends  on  the  presence  of  QUESTION). 
This  can  be  done  not  only  for  full  sentences,  but  for  smaller 
syntactic  units  such  as  noun  groups  and  prepositional  groups,  or 
even  for  Individual  words. 

In  addition  we  can  study  the  different  functions  a 
syntactic  "unit"  can  have  as  a  part  o:  a  larger  unit.  In 
"Nobody  wants  to  be  alone.",  the  clause  "to  be  alone"  has  the 
function  of  OBJECT  In  the  sentence,  while  the  noun  group 
"nobody"  Is  the  SUB>icCT.  We  can  note  that  a  transitive  clause 
must  hc»'e  units  to  fill  the  functions  of  SUBJECT  and  OBJECT,  or 
that  a  WH-  question  has  to  have  some  constituent  v^hlch  has  the 
role  of  "question  element"  (like  ''why"  in  "Why  did  he  go?"  or 
"which  dog"  In  "Which  dog  stole  the  show?"). 

In  most  current  theories,  these  features  and  functions  are 
Implicit  In  the  syntactic  rules.  There  Is  no  explicit  mention 
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of  them,  but  the  rules  are  designed  In  such  a  way  that  every 
sentence  will  In  fact  be  one  of  the  three  types  listed  above, 
and  every  VJH-  question  will  In  fact  have  a  question  element. 

The  difficulty  Is  that  there  Is  no  attempt  In  the  grammar  to 
distinguish  significant  features  such  as  these  from  the  Infinite 
number  of  other  features  we  could  note  about  a  sentence,  and 
which  are  also  Implied  by  the  rules. 

If  we  look  at  the  "deep  structure"  of  a  sentence,  again  the 
features  and  functions  are  Implicit.  The  fact  that  It  Is  a  YES- 
NO  question  Is  Indicated  by  a  question  marker  hanging  from  a 
particular  place  In  the  tree,  and  the  fact  that  a  component  is 
the  object  or  subject  ‘s  determined  from  Its  exact  relation  to 
the  branches  around  It.  The  problem  Isn't  that  there  Is  no  way 
to  find  these  features  In  a  parsing,  but  that  most  theories 
don't  bother  to  ask  "Which  features  of  a  syntactic  structure  are 
Important  to  conveying  meaning,  and  which  are  just  a  by-product 
of  the  symbol  manipulations  needed  to  produce  the  right  word 
order." 

What  we  would  like  Is  a  theory  In  which  these  choices  of 
features  are  primary.  Professor  M.A.K.  Halllday  at  the 
Unlv'irslty  of  London  has  been  working  on  such  a  theory,  called 
Systemic  Grammar  (see  references  <Halllday  1961,  1966a,  1966b, 
1967>  <Huddleston>,  <Hudson>).  His  theory  recognizes  that 
meaning  Is  of  prime  Importance  to  the  way  language  Is 
structured.  Instead  of  having  a  "deep  structure"  which  looks 
like  a  kind  of  syntactic  structure  tree,  he  deals  v/lth  "system 
networks"  which  describe  the  way  different  features  Interact  and 
depend  on  each  other.  The  primary  emphasis  Is  on  analyzing  the 
limited  and  highly  structured  sets  of  choices  which  are  made  In 
producing  a  sentence  or  constituent.  The  exact  way  In  which 
these  choices  are  "realized"  In  the  final  form  Is  a  necessary 
but  secondary  part  of  the  theory. 
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The  reallration  rules  carry  out  the  work  which  wauld  be 
done  by  t  rans  f  orina  1 1  ons  In  transformational  grammar  (TG).  In 
TGj  the  sentences  "Sally  saw  the  squirrel.",  "The  squirrel  was 
seen  by  Sally.",  and  "Did  Sally  see  the  squirrel?"  would  be 
derived  from  almost  Identical  deep  structures,  and  the 
difference  In  final  form  Is  produced  by  t ransformat I ons .  In 
systemic  grammar,  these  would  be  analyzed  as  having  most  of 
their  features  In  common,  but  differing  In  one  particular 
choice,  such  as  PASSIVE  vs.  ACTIVE,  or  DECLARATIVE  vs.  QUESTION. 
The  realization  rules  would  then  describe  the  exact  word  order 
used  to  signal  these  features. 

What  does  this  theory  give  us  to  use  In  a  language 
understanding  program?  What  kinds  of  parsings  does  It  produce? 
If  we  look  at  a  typical  parsing  by  a  systemic  grammar,  we  note 
several  points.  First,  It  Is  very  close  to  the  surface 
structure  of  the  sentence.  There  Is  no  rearrangement  Into 
supposed  "underlying"  forms.  Instead,  each  constituent  Is 
marked  with  features  Indicating  Its  structure  and  function. 
Instead  of  saying  that  "Did  John  go?"  has  an  underlying 
structure  which  looks  like  "John  went.",  we  simply  note  that  It 
has  the  features  QUESTION  and  YES-NO,  and  that  the  noun  group 
"John"  has  the  function  SUBJECT.  Other  parts  of  the  language 
understanding  process  do  not  have  to  be  concerned  with  the  exact 
way  the  parsing  tree  Is  structured,  since  they  can  deal  directly 
with  the  relevant  features  and  functions. 
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V^hat  Is  more  Important  Is  that  these  features  are  not 
Tt-ndom  unrelated  lists  of  observations.  They  are  part  of  a 
highly  structured  network,  and  the  grammatical  theory  Includes  a 
description  of  that  network.  When  we  do  semantic  analysis,  we 
are  not  faced  with  the  task  of  Inventing  "projection  rules"  to 
deal  with  the  raw  form  of  specific  syntactic  rules.  Instead  we 
can  ask  "What  aspect  of  meaning  does  this  system  convey?"/  anri 
"What  Is  the  significance  of  this  particular  feature  within  Its 


system?". 
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Pars  Ini; 

In  Implementing  a  systemic  grammar  for  a  computer  program 
for  unde r s tand I ng  language#  we  are  concerned  with  the  process  of 
recognition  rather  than  that  of  generation.  We  do  not  begin 
with  choices  of  features  and  try  to  produce  a  sentence.  Instead 
we  are  faced  with  a  string  of  letters#  and  the  job  Is  to 
recognize  the  patterns  and  features  In  It.  We  need  the  Inverse 
of  realization  rules  --  Interpretation  rules  which  look  at  a 
pattern#  Identify  Its  structure#  and  recognize  Its  relevant 
features.  This  Interpretation  process  Is  closely  related  to 
ocher  types  of  pattern  recognition#  and  many  Interesting 
parallels  can  be  drawn  with  the  process  of  Interpreting  a  visual 
scene  (see  <V;lnograd  1969>).  The  Important  aspect  of  both  types 
of  Interpretation  Is  looking  for  symbolic  features  which  will  be 
relevant  to  understanding#  so  that  the  parsing  can  be  Integrated 
with  the  rest  of  the  understanding  process.  In  general#  this 
problem  of  Isolating  Important  features  from  complex  Information 
and  representing  them  symbolically  Is  a  central  Issue  for 
Artificial  Intelligence#  and  the  Ide?  of  a  "systemic"  parser  may 
be  of  use  In  other  areas. 

The  parsing  system  for  our  program  is  actually  an 
Interpreter  for  PROGRAMMAR#  a  language  for  writing  grammars.  It 
is  basically  a  top-down  lef t-to-r Ight  parser#  but  It  modifies 
these  properties  when  It  Is  advantageous  to  do  so.  By  writing 
In  a  language  designed  for  grammars#  we  can  express  the 
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rejiul  ar  1 1 1  es  of  language  In  a  straightforward  way/  as  ilmply  as 
In  a  syntax-directed  parser.  The  primitives  of  the  language  are  * 

these  operations  which  have  particular  relevance  to  natural 
language  and  Its  parsing. 

A  program  for  parsing  language  Is  as  much  a  "generative" 
description  of  the  language  as  Is  a  set  of  rules  for  producing 
sentences.  The  meaning  of  "generative"  In  Chomsky's  original 
sense  (<Chomsky  1957/  1965>)  Is  that  the  grammar  should 
associate  a  structural  description  to  each  permissible  sentence 
In  the  language.  A  parsing  program  does  just  that. 

By  making  the  formalism  for  grammars  a  programming 
language/  we  enable  the  grammar  to  use  special  tools  to  handle 
complex  constructions  and  Irregular  forms.  For  example/  we  can 
set  up  programs  to  define  certain  words  like  "and"/  and  "or" 
as  "demons"/  which  cause  an  Interrupt  In  the  parsing  process 
whenever  they  are  encountered  1..  the  normal  lef t-to-rlght  order/ 

In  order  to  run  a  special  program  for  conjoined  structures. 

Idioms  can  also  be  handled  using  this  "Interrupt"  concept.  In 
fact/  the  process  can  be  Interrupted  at  any  point  In  the 
sentence/  and  any  other  computations  (either  semantic  or 
syntactic)  can  be  performed  before  going  on.  These  may 
themselves  do  bits  of  parsing/  or  they  may  change  the  course  the 
basic  program  will  take  after  they  are  done. 

It  Is  paradoxical  that  linguistic  workers  familiar  with 
computers  have  generally  not  appreciated  the  Importance  of  the 
"control"  aspect  of  programming/  and  have  not  used  the  process- 
describing  potentialities  of  programming  for  their  parsing 
theories.  They  have  Instead  restricted  themselves  to  the 
narrowest  kinds  of  rules  and  transformations  --  as  though  a  ’* 

programmer  were  to  stick  to  such  simple  models  as  Turing 
machines  or  Post  productions.  Designers  of  computer  languages 
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today  show  this  same  tendency!  See  Minsky's  remark  In  his 
Turing  lecture  <Mlnsky  1970>.  Our  parser  uses  semantic  guidance 
at  all  points,  looking  for  a  meaningful  parsing  of  the  sentence 
rather  than  trying  all  of  the  syntactic  possibilities.  Section 
2.2  describes  PROGRAMMAR  in  detail,  and  2.3  gives  a  sample 
grammar  for  English.  Section  2.4  explains  programming  details, 
and  shows  how  the  special  features  of  the  language  are  actually 
used  to  handle  specific  linguistic  problems. 
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2,2  A  Description  of  PROGRAMMAR 
2.2.1  Grammar  and  Computers 

In  order  to  explain  the  features  of  PROGRAMMAR,  we  will 
summarize  some  of  the  principles  of  grammar  used  In  computer 
language  processing.  The  basic  form  of  most  grammars  Is  a  list 
(ordered  or  unordered)  of  "replacement  rules,"  which  represent  a 
processs  of  sentence  generation.  Each  rule  states  that  a 
certain  string  of  symbols  (Its  left  side)  can  be  replaced  by  a 
different  set  of  symbols  (Its  right  side).  These  symbols 
Include  both  the  actual  symbols  of  the  language  (called  terminal 
symbols)  and  additional  "non-terminal"  symbols.  One  non¬ 
terminal  symbol  Is  designated  as  a  starting  symbol,  and  a  string 
of  terminal  symbols  Is  a  sentence  If  and  only  If  It  can  be 
derived  from  the  starting  symbol  through  successive  application 
of  the  rules.  For  example  we  can  write  Grammar  1; 

1.1  S  ->  NP  VP 

1.2  NP  ->  DETERMINER  NOUN 

1.3  VP  ->  VERB/INTRANSITIVE 

1.4  VP  ••>  VERB/TRANSITIVE  NP 

1.5  DETERMINER  ->  the 

1.6  NOUN  ->  giraffe 

1.7  NOUN  ->  apple 

1.8  VERB/ INTRANSITIVE  ->  dreams 

1.9  VERB/TP.AMSITIVE  ->  eats 

Figure  20  —  GRAMMAR  1 

By  starting  with  S  and  applying  the  list  of  rules  (1.1  1.2 
1.5  1.6  1.4  1.2  1.7  1.5  1.9),  we  get  the  sentence  "The  giraffe 
eats  the  apple."  Several  things  are  noteworthy  here.  This  Is  an 
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unordered  set  of  rules.  Each  rule  can  be  applied  any  number  of 
times  at  any  point  In  the  derivation  where  the  symbol  appears. 

In  addition#  each  rule  Is  optional.  We  could  just  as  well  have 
reversed  the  applications  of  1.6  and  1.7  to  get  "The  apple  eats 
the  giraffe."#  or  have  used  1.3  and  1.8  to  get  "The  giraffe 
dreams."  This  type  of  derivation  can  be  represented  graphically 
as : 


We  will  call  this  the  parsing  tree  for  the  sentence#  and 
use  the  usual  terminology  for  trees  (node#  subtreee#  daughter# 
parent#  etc.).  In  addition  we  will  use  the  linguistic  terms 
"phrase"  and  "constituent"  Interchangeably  to  refer  to  a 
subtree.  This  tree  represents  the  "Immediate  co'«3tl  tuent" 
structure  of  the  sentence.  The  PROGRAMMAR  language  Is  a  general 
parsing  system  which#  although  oriented  ttoward  systemic 
grammar#  can  be  used  to  parse  grammars  based  on  other  theories. 
In  descrbing  PROGRAMMAR  we  have  used  a  more  conventional  sat  of 
notations  and  analysis  of  English  In  order  to  make  the 
description  independent  of  the  work  presented  In  later  sections. 
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2. 2. ,2  CQnJL£Xt-f  ree  aM  Context-sens  I  tlve  Q.Lafnmg  r.£ 

Grammar  1  I s  an  oxample  of  what  Is  called  a  context-free 
grammar.  The  left  side  of  each  rule  consists  of  a  single 
syi.ibol  ,  and  the  indicated  replacement  can  occur  whenever  that 
symbol  Is  encountered.  There  are  a  great  number  of  different 
forms  of  grammar  which  can  be  shown  to  be  equivalent  to  this 
one/  In  that  they  can  characterize  the  same  languages.  It  has 
been  pointed  out  that  they  are  not  theoretically  capable  of 
expressing  the  rules  of  English,  to  produce  such  sentences  as, 
"John,  Sidney,  and  Chan  ordered  an  eggroll,  a  ham  sandwich,  and 
a  bagel  respective!  y.**  Much  more  Important,  even  though  they 
could  theoretically  handle  the  bulk  of  the  English  language, 
they  cannot  do  this  at  all  efficiently.  Consider  the  simple 
problem  of  subject-verb  agreement.  We  would  Ilk.  a  grammar 
which  gene-ates  "The  giraffe  dreams."  and  "The  giraffes  dream.", 
but  not  "The  giraffe  dream."  or  "The  giraffes  dreams.".  In  a 
context-free  grammar,  we  can  do  this  by  Introducing  two  starting 
symbols,  S/PL  and  S/SG  for  plural  and  singular  respectively, 
then  duplicating  each  rule  to  match.  For  example,  we  would 
have: 


1.1.1 

1.1.2 

S/PL  ->  N6/PL  VP/PL 
S/SG  ->  NG/SG  VP/SG 

1.2.1 

NG/PL  -> 

DETERMINER 

NOUN/PL 

1.2.2 

•  •  « 

NG/SG  -> 

DETERMINER 

NOUN/SG 

1.6.1 

NOUN/PL  -> 

gl raf fes 

1.6.2 

etc . 

NOUN/SG  -> 

gl raf fe 
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If  we  then  wish  to  handle  the  difference  between  "I  am", 

"he  Is",  etc.  we  must  Introduce  an  entire  new  set  of  symbols 
for  first-person.  This  sort  of  duplication  propagates 
mu  1 1 1 p 1  I ca 1 1  VC  1 y  through  the  grammar,  and  arises  In  all  sorts  of 
cases.  For  example,  a  question  and  the  corresponding  statement 
will  have  much  In  common  concerning  their  subjects,  objects 
verbs,  etc.,  but  In  a  context-free  grammar,  they  will  In  general 
be  expanded  through  two  entirely  different  sets  of  symbols. 

One  way  to  avoid  this  problem  Is  to  use  context-sensitive 
rules.  In  these,  the  left  side  may  Include  several  symbols,  and 
the  replacement  occurs  when  that  combination  of  symbols  occurs 
In  the  string  being  generated. 
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Systemic  Grammar 

Vie  can  add  power  to  our  grammar  with  context-sens  I  tve  rules 
which,  for  example.  In  expanding  the  symbol  VERS/ I NTRANS I T I VE, 
look  to  the  preceding  symbol  to  decide  whether  It  Is  singular  or 
plural.  By  using  such  context-sensitive  rules,  we  can 
characterize  any  language  whose  sentences  can  be  listed  by  a 
deterministic  (possibly  neverending)  process.  (I.e.  they  have 
the  power  of  a  turing  machine).  There  Is  however  a  problem  In 
Implementing  these  rules.  In  any  but  the  simplest  cases,  the 
context  will  not  be  as  obvious  as  In  the  simple  example  given. 
The  choice  of  replacements  will  not  depend  cn  a  single  word,  but 
may  depend  In  a  complex  way  on  the  entire  structure  of  the 
sentence.  Such  dependencies  cannot  be  expressed  In  our  simple 
rule  format,  and  new  types  of  rules  must  be  developed. 
Transformational  grammar  solves  this  by  breaking  the  generation 
process  down  Into  the  context-free  base  grammar  which  produces 
"deep  structure"  and  a  set  of  transformations  which  then  operate 
on  this  structure  to  produce  the  actual  "surface  structure"  of 
the  grammatical  sentence.  We  will  not  go  Into  the  details  of 
transformational  grammar,  but  one  basic  Idea  Is  this  separation 
of  the  complex  aspects  of  language  Into  a  separate 
transformational  phase  of  the  generation  process. 

Systemic  grammar  Introduces  context  In  a  more  unified  way 
Into  the  Immediate-constituent  generation  rules.  This  Is  done 
by  Introducing  "features"  associated  with  constituents  at  every 
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level  of  the  parsing  tree.  A  rule  of  the  grammar  may  depend/ 
for  example,  on  whether  a  particular  clause  Is  transitive  or 
intransitive.  In  the  examples  "Fred  found  a  frog.",  "A  frog  was 
found  by  Fred.",  and  "V/hat  did  Fred  find?",  all  are  transitive, 
but  the  outward  forms  are  quite  different.  A  context-sensitive 
rule  which  checked  for  this  feature  directly  In  the  string  being 
generated  would  have  to  be  quite  complex.  Instead,  we  can  allow 
each  symbol  to  have  additional  subscripts,  or  features  which 
control  Its  expansion.  In  a  way,  this  Is  like  the  separation  of 
the  symbol  NP  Into  NP/PL  and  NP/SG  In  our  augmented  context-free 
grammar.  But  It  Is  not  necessary  to  develop  whole  new  sets  of 
symbols  with  a  set  of  expansions  for  each.  A  symbol  such  as 
CLAUSE  may  be  associated  with  a  whole  set  of  features  (such  as 
TRANSITIVE,  QUESTION,  SUBJUNCTIVE,  OBJECT-QUESTION,  etc.)  but 
there  Is  a  single  set  of  rules  for  expanding  CLAUSE.  These 
rules  may  at  various  points  depend  on  the  set  of  features 
present. 

The  power  of  systemic  grammar  rests  on  the  observation  that 
the  context-dependency  of  natural  language  is  centered  around 
clearly  defined  and  highly  structured  sets  of  features,  so 
through  their  use  a  great  deal  of  complexity  can  be  handled  very 
economically.  More  Important  for  our  purposes,  there  Is  a  high 
correlation  between  these  features  and  the  semantic 
interpretation  of  the  constituents  which  exhibit  them.  They 
cannot  be  put  In  a  one-to-one  correspondence  with  semantic 
properties  of  the  phrases  In  which  they  appear,  but  are  a 
tremendous  aid  to  Interpretation. 

A  parsing  of  a  sentence  In  a  systemic  grammar  might  look 
very  much  like  a  context-free  parsing  tree,  except  that  to  each 
node  would  be  attached  a  number  of  features.  These  features  are 
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not  random  combinations  of  facts  about  the  constituent/  but  are 
a  part  of  a  carefully  worked  out  analysis  of  a  language  in  terms 
of  Its  "systems".  The  features  are  organized  In  a  network,  with 
clearly  organized  dependencies.  For  example,  the  features 
IMPERATIVE  (command)  and  QUESTION  are  mutually  exclusive  In  a 
clause,  as  are  the  features  YES-NO  (yes-no  question  like  "Did  he 
go?")  and  WH-  question  (like  "Who  went?).  In  addition,  the 
second  choice  can  be  made  only  If  the  choice  QUESTION  was  made 
In  the  first  set.  A  set  of  mutually  exclusive  features  Is 
called  a  "system",  and  tne  set  of  other  features  which  must  be 
present  for. the  choice  to  be  possible  Is  called  the  "entry 
condition"  for  that  system.  This  Is  discussed  In  detail  In 
section  2,3. 

Another  basic  concept  of  systemic  grammar  Is  that  of  the 
rank  of  a  constituent.  Rather  than  having  a  plethora  of 
different  non-terminal  symbols,  each  expanding  a  constituent  In 
a  slightly  different  way,  there  are  only  a  few  basic  "units", 
each  having  the  possibility  of  a  number  of  different  features, 
chosen  from  the  "system  network"  for  that  unit.  In  an  analysis 
of  English,  three  basic  units  seem  to  explain  the  structure:  the 
CLAUSE,  the  GROUP,  and  the  WORD.  In  general,  clauses  are  made 
up  of  groups,  and  groups  made  up  of  words.  However,  through 
"ranksh I f t",  clauses  or  groups  can  serve  as  constituents  of 
other  clauses  or  groups.  Thus,  In  the  sentence  "Sarah  saw  the 
student  sawing  logs."  "the  student  sawing  logs"  Is  a  NOUN  GROUP 


Section  2.2.3  -  Page  76 


with  the  CLAUSE  "sawing  logs"  as  a  constituent  <a  modifier  of 
*  "student")  . 

The  constituents  "who"/  "three  days"/  "some  of  the  men  on 
the  board  of  directors/"  and  "anyone  who  doesn't  understand  me" 
are  all  noun  groups/  exhibiting  different  features.  This  means 
that  a  PROGRAMMAR  grammar  will  have  only  a  few  programs/  one  to 
deal  with  each  of  the  basic  units.  Our  current  grammar  of 
English  has  programs  for  the  units  CLAUSE/  NOUN  GROUP/  VERB 
GROUP/  PREPOSITION  GROUP/  and  ADJECTIVE  GROUP. 
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IjJLJl  Granrmjrs  5A  Programs 

Earlier  we  pointed  out  that  a  complete  generative 
description  of  a  language  can  be  In  the  form  of  a  program  for 
parsing  It.  For  simple  grammars,  there  Is  a  close 
correspondence  between  the  parsing  program  and  the  usual 
generation  rules. 

We  can  think  of  a  grammar  as  a  set  of  Instructions  for 
parsing  a  sentence  In  the  language.  A  rule  like:  NP  -> 
DETERMINER  NOUN  can  be  Interpreted  as  the  Instruction  "If  you 
want  to  find  a  NP,  look  for  a  DETERMINER  followed  by  a  NOUN." 
Grammar  1  could  be  diagrammed  as  shown  In  Figure  22. 

The  basic  function  used  Is  PARSE,  a  function  which  tries  to 
add  a  constituent  of  the  specified  type  to  the  parsing  tree.  If 
the  type  has  been  defined  as  a  PROGRAMMAR  program,  PARSE 
activates  the  program  for  that  unit,  giving  It  as  Input  the  part 
of  the  sentence  yet  to  be  parsed  and  (optionally)  a  list  of 
Initial  features.  If  no  definition  exists,  PARSE  Interprets  Its 
arguments  as  a  list  of  features  which  must  be  found  In  the 
dictionary  definition  of  the  next  word  In  the  sentence.  If  so. 

It  attaches  a  node  for  that  word,  and  removes  It  from  the 
remainder  of  the  sentence.  If  not.  It  falls.  If  a  PROGRAMMAR 
program  has  been  called  and  succeeds,  the  new  node  Is  attached 
to  the  parsing  tree.  If  I'  falls,  the  tree  Is  left  unchanged. 


Sect  Ion  2 . 2  .  U 
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DEFINE  Drop:rom  SENTENCE 


RETURN  success 


'll' 

RETURN  success 


DEFINE  program  VP 


Figure  22  — Simple  Parsing  Program 
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Z.»1a1  FP-Tffl  Pf  PROGRAMMAR  Grammars 


Written  In  PROGRAMMAR,  the  progra'^is  v.'ould  look  like: 


2.1 

(PDEFINE 

SENTENCE 

2.2 

(((PARSE 

NP)  NIL  FAIL) 

2.3 

((PARSE 

VP)  FAIL  FAIL  RETURN))) 

2.4 

(PDEFINE 

NP 

2.5 

(((PARSE 

DETERMINER)  NIL  FAIL) 

2.6 

((PARSE 

NOUN)  RETURN  FAIL))) 

2.7 

(PDEFINE 

VP 

2.8 

(((PARSE 

VERB)  NIL  FAIL) 

2.9 

(dSQ  H 

TRANSITIVE)  NIL  INTRANS  ) 

2.10 

((PARSE 

NP)  RETURN  NIL) 

2.11 

INTRANS 

2.12 

(dSQ  H 

INTRANSITIVE)  RETURN  FAIL))) 

Rules 

1.6  to  : 

1.9  would  have  the  form: 

2.13 

(DEFPROP 

GIRAFFE  (NOUN)  WORD) 

2.14 

(DEFPROP 

etc. 

DREAM  (VERB  INTRANSITIVE)  WORD) 

Figure  23  —  Grammar  2 

This  example  Illustrates  some  of  the  basic  features  of 
PROGRAMMAR.  First  It  Is  embedded  In  LISP,  and  much  of  Its 
syntax  Is  LISP  syntax.  Units,  such  as  SENTENCE  are  defined  as 
PROGRAMMAR  programs  of  nr  arguments.  Each  tries  to  parse  the 
string  of  words  left  to  be  parsed  In  the  sentence.  The  exact 
form  of  this  Input  string  Is  described  In  section  2.4.8.  The 
value  of  (PARSE  SENTENCE)  will  be  a  list  structure  corresponding 
to  the  parsing  tree  for  the  complete  sentence. 

Each  time  a  call  Is  made  to  the  function  PARSE,  the  system 
begins  to  build  a  new  node  on  the  tree.  Since  PROGRAMMAR 

programs  can  call  each  other  recursively.  It  Is  necessary  to 
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keep  a  pushdown  list  of  nodes  which  are  not  yet  completed  (t.e. 
the  entire  rightmost  branch  of  the  tree).  These  are  a’l  called 
"active"  nodes/  and  the  one  formed  by  the  most  recent  call  to 
PARSE  Is  called  the  "currently  active  node". 

We  can  examine  our  sample  program  to  see  the  basic 
operation  of  the  language.  Whenever  a  PROGRAMMAR  program  Is 
called  directly  by  the  user#  a  node  of  the  tree  structure  Is  set 
up,  and  a  set  of  special  variables  are  bound  (see  section 
2.4.9).  The  lines  of  the  program  are  then  executed  In  sequence, 
as  In  a  LISP  PROG,  except  when  they  have  the  special  form  of  a 
BRANCH  statement  (a  list  whose  first  member  (the  CONDITION)  Is 
non-atomlc,  and  which  has  either  2  or  3  other  membe.'S,  called 
DIRECTIONS).  Line  2.3  of  GRAMMAR  2  Is  a  three-direction  branch, 
and  all  the  other  executable  lines  of  the  program  are  two- 
dlrectlon  branches, 

Vlhen  a  branch  statement  Is  encountered,  the  condition  Is 
evaluated,  and  branching  depends  on  Its  value.  In  a  two- 
dlrectlon  branch,  the  first  direction  Is  taken  If  It  evaluates 
to  non-NlL,  the  second  direction  If  It  Is  NIL.  In  a  three- 
direction  branch,  the  first  direction  Is  taken  only  If  the 
condition  Is  non-NIL,  and  there  Is  more  of  the  sentence  to  be 
parsed.  If  no  more  of  the  sentence  remains,  and  the  condition 
evaluates  non-NIL,  the  third  direction  Is  taken. 

The  directions  can  be  of  three  types.  First,  there  are 
three  reserved  words,  NIL,  RETURN,  and  FAIL.  A  direction  of  NIL 
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sends  evaluation  to  the  next  statement  In  the  program.  FAIL 
causes  the  program  to  return  MIL  after  restoring  the  sentence 
oi.d  the  parsing  tree  to  their  state  before  that  program  wes 
called.  RETURN  causes  the  program  to  attach  the  currently 
active  node  to  the  completed  parsing  tree  and  return  the  subtree 
below  that  node  as  Its  value. 

If  the  direction  Is  any  other  atom#  It  acts  as  a  GO 
statement/  transferring  evaluation  to  the  statement  Immediately 
following  the  occurence  of  that  atom  as  a  tag.  For  example/  If 
a  failure  occurs  In  line  2.9/  evaluation  continues  with  line 
2.12,  If  the  direction  Is  non-atomic/  the  result  Is  the  same 
as  a  FAIL/  but  the  direction  Is  put  on  a  special  failure  message 
list/  so  the  calling  program  can  see  the  reason  for  failure. 
DIRECTION'S  can  also  be  used  In  the  function  GOCOND.  The 
statment  (GOCOND  TAGl  TAG2)  causes  the  program  to  go  to  TAGl  If 
tnere  are  words  left  to  be  parsed/  and  to  TAG2  otherwise. 

Looking  at  the  programs/  we  see  that  SENTENCE  will  succeed 
only  If  It  first  finds  a  NP/  then  finds  a  VP  which  uses  up  the 
rest  of  the  sentence.  In  the  program  VP/  vve  see  that  the  first 
branch  statement  checks  to  see  whether  the  next  word  Is  a  verb. 
If  SO/  It  removes  It  from  the  remaining  sentence/  and  goes  on. 

If  not/  VP  falls.  The  second  statement  uses  the  PROGRAMMAR 
function  ISQ/  one  of  the  functions  used  for  checking  features. 
(ISO  A  B)  checks  to  see  whether  the  node  or  word  pointed  to  by  A 
has  the  feature  B.  H  Is  one  of  a  number  of  special  variables 
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used  to  hold  Information  associated  with  a  node  of  the  parsing 
tree,  xsee  section  2.U.9)  It  points  to  the  last  word  or 
ccnstituent  parsed  by  that  program.  Thus  the  condition  (ISQ  H 
TRANSITIVE)  succeeds  only  If  the  verb  just  found  by  PARSE  has 
the  feature  TRANSITIVE.  If  so,  the  direction  NIL  sends  It  on  to 
the  next  statement  to  look  for  a  NP,  and  If  It  finds  one  It 
returns  success.  If  either  no  such  NP  Is  found  or  the  verb  Is 
not  TRANSITIVE,  control  goes  to  the  tag  INTRANS,  and  If  the  verb 
Is  INTRANSITIVE,  the  program  VP  succeeds.  Note  that  a  verb  can 
have  both  the  features  INTRANSITIVE  and  TRANSITIVE,  and  the 
parsing  will  then  depend  on  whether  or  not  an  object  NP  Is 


found 
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Context-Sensitive  Aspects 

So  far#  we  have  done  little  to  go  beyond  a  context-free 
grammar.  How,  for  example,  can  we  handle  agreement?  One  way  to 
•’ ■)  this  would  be  for  the  VP  program  to  look  back  In  the  sentence 
for  the  subject,  and  check  Its  agreement  with  the  verb  before 
going  on.  We  need  a  way  to  climb  around  on  the  parsing  tree, 
looking  at  Its  structure.  In  PROGRAMMAR,  this  Is  done  with  the 
pointer  PT  and  the  moving  function  *. 

Whenever  the  function  *  Is  called.  Its  arguments  form  a 
list  of  Instructions  for  moving  PT  from  Its  present  po>ItIor. 
These  Instructions  can  be  quite  general,  saying  things  like 
"Move  left  until  you  find  a  unit  with  feature  X,  then  up  until 
you  find  a  CLAUSE,  then  down  to  Its  last  cons t I tutent,  and  left 
until  you  find  a  unit  meeting  the  arbitrary  condition  Y."  The 
Instruction  list  contains  non-atomic  CONDITIONS  and  atomic 
INSTRUCTIONS.  The  Instructions  are  taken  In  order,  and  when  a 
condition  ,  Is  encountered,  the  preceding  Instruction  Is  evaluated 
repeatedly  until  the  condition  Is  satisfied.  If  the  condition 
Is  of  the  form  (ATOM),  It  Is  satisfied  only  If  the  node  pointed 
to  by  PT  has  the  feature  ATOM.  Any  other  condition  Is  evaluated 
by  LISP,  and  I s  satisfied  If  It  returns  a  non-NIL  value. 

Section  2.4.10  lists  the  Instructions  for  *. 

For  example,  evaluating  (*  C  U)  will  set  the  pointer  to  the 
parent  of  the  currently  active  node.  (The  mnemonics  are: 

Current,  Up)  The  call  (*  C  DLC  PV  (NP))  will  start  at  the 


I 
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current  node,  move  down  to  the  rightmost  completed  node  (l.e. 
not  currently  active)  then  move  left  until  It  finds  a  node  with 
the  feature  NP.  (Oown-Last-Completed,  Previous).  If  *  succeeds. 
It  returns  the  new  value  of  PT  and  leaves  PT  set  to  that  value. 
If  It  falls  at  any  point  In  the  list,  because  the  existing  tree 
structure  makes  a  command  Impossible,  or  because  a  condition 
cannot  be  satisfied,  PT  Is  left  at  Its  original  position,  and  ♦ 
returns  NIL. 

We  can  now  add  another  branch  statement  to  the  VP  program 
In  section  2.2.5  between  lines  2.8  and  2.9  as  follov/s: 

2.8.1  ((0R(AND(  ISQ(*  C  PV  DLOS  I NGULAR)  (  1  SQ  H  SINGULAR)) 

2.8.2  (ANDdSQ  PT  PLURADdSQ  H  PLURAL))) 

2.8.3  NIL  (AGREEMENT)) 

This  Is  an  example  of  a  branch  statement  with  an  error 
message.  It  moves  the  pointer  from  the  currently  active  node 
(the  VP)  to  the  previous  node  (the  NP)  and  down  to  Its  last 
contituent  (the  noun).  It  then  checks  to  see  whether  this 
shares  the  feature  SINGULAR  with  the  last  constituent  parsed  by 
VP  (the  verb).  If  not  It  checks  to  see  whether  they  share  the 
feature  PLURAL.  Notice  that  once  PT  has  been  set  by  *,  It 
remains  at  that  position.  If  agreement  Is  found,  evaluation 
continues  as  before  with  line  2.9.  If  not,  the  program  VP  falls 
with  the  message  (AGREEMENT). 

So  far  we  have  not  made  much  use  of  features,  except  on 
words.  As  the  grammar  gets  more  complex,  they  become  much  more 
Important.  As  a  simple  example,  we  may  wish  to  augment  our 
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grammar  to  accept  the  noun  groups  "these  fish/"  "this  fish/" 
"the  glraffeS/"  and  "the  giraffe/"  but  not  "these  giraffe/"  or 
"this  giraffes."  We  can  no  longer  check  a  single  word  for 
agreement/  since  "fish"  gives  no  clue  to  number  In  the  first 
twO/  while  "the"  gives  no  clue  In  the  third  and  fourth.  Mumber 
Is  a  feature  of  the  entire  noun  grouP/  and  we  must  Interpret  It 
In  some  cases  from  the  form  of  the  noun/  and  In  others  from  the 
form  of  the  determiner. 

We  can  rewrite  our  programs  to  handle  this  complexity  as 
shown  In  Grammar  3: 


3.1  (PDEFINE  SENTENCE 

3.2  (((PARSE  N3)NIL  FAIL) 

3.3  ((PARSE  VP)  FAIL  FAIL  RETURN))) 


3.4  (PDEFINE  NP 

3.5  (((AND(PARSE  DETERM  I NER) (FQ  DETERM  I  NED) ) N I L  NIL  FAIL) 

3.6  ((PARSE  NOUN)NIL  FAIL) 

3.7  ((CQ  DETERMINED)DET  NIL) 

3.8  ((AND(*  H)(TRNSF  (QUOTE(S INGULAR  PLURAL) ))) RETURN  FAIL) 

3.9  DET 

3.10  ((TRNSF  (MEET(FE(*  H  PV  (DETERMINER))) 

3.11  (QU0TE(S INGULAR  PLURAL)))) 

3.12  RETURN 

3.13  FAIL))) 

3.14  (PDEFINE  VP 

3.15  (((PARSE  VERB)NIL  FAIL) 

3.16  ((MEET(FE  H)(FE(*  C  PV  (NP) ) ) (QU0TE(S 1 NGULAR  PLURAL))) 

3.17  NIL 

3.18  (AGREEMENT)) 

3.19  (dSQ  H  TRANSITIVE)NIL  >NTP,ANS) 

3.20  ((PARSE  NP) RETURN  NIL) 

3.21  (dSQ  H  INTRANSITIVE)RETURN  FAIL))) 

Figure  24  --  Grammar  3 


Section  2.?. 6  -  Page  86 


We  have  used  the  PROGRAMMAR  functions  FQ  and  TRNSF/  which 
attach  features  to  constituents.  The  effect  of  evaluating  (FQ 
A)  Is  to  add  the  feature  A  to  the  list  of  features  for  the 
currently  active  node  of  the  parsing  tree.  TRNSF  Is  used  to 
transfer  features  from  the  pointer  to  the  currently  active  node. 
Its  argument  Is  a  list  of  features  to  be  looked  for.  For 
example^  line  3.8  looks  for  the  features  SINGULAR  and  PLURAL  In 
the  last  constituent  parsed  (the  NOUN),  and  adds  whichever  ones 
It  finds  to  the  currently  active  node.  The  branch  statement 
beginning  with  line  3.10  Is  more  complex.  The  function  *  finds 
the  DETERMINER  of  the  NP  being  parsed.  The  function  FE  finds 
the  list  of  features  of  this  node,  and  the  function  MEET 
intersects  this  with  the  list  of  features  (SINGULAR  PLURAL). 

This  Intersection  Is  then  the  set  of  allowable  features  to  be 
transferred  to  the  NP  node  from  the  NOUN.  Therefore  If  there  Is 
no  agreement  beween  the  NOUN  and  the  DETERMINER,  TRNSF  falls  to 
find  any  features  to  transfer,  and  the  resulting  failure  causes 
the  rejection  of  such  phrases  as  "these  giraffe." 

In  line  3.7  we  use  the  function  CQ  which  checks  for 
features  on  the  current  node.  (CQ  DETERMINED)  will  be  non-NIL 
only  If  the  current  node  has  the  feature  DETERMINED,  (l.e.  It 
was  put  there  In  line  3.5)  Therefore,  a  noun  group  with  a 
determiner  Is  marked  with  the  feature  DETERMINED,  and  Is  also 
given  features  corrresponding  to  the  Intersection  of  the  number 
features  associated  with  the  determiner  If  there  Is  one,  and  the 
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noun.  Notice  that  this  gr^immar  can  accept  noun  groups  without 
determiners,  as  In  "Giraffes  eat  apples."  since  line  3.5  falls 
only  If  a  DETERMINER  Is  found  and  there  are  no  more  words  In  the 
sentence. 

In  conjunction  with  the  change  to  the  MP  program,  the  VP 
program  must  be  modified  to  check  with  the  NP  for  agreement. 

The  branch  statement  beginning  on  Line  3.16  does  this  by  making 
sure  there  Is  a  number  feature  common  to  both  the  subject  and 
the  verb. 

This  brief  description  explains  some  of  the  basic  features 
of  PROGRAMMAR.  In  a  simple  grammar,  their  Importance  Is  not 
obvious,  and  Indeed  there  seem  to  be  easier  ways  to  achieve  the 
same  effect.  As  grammars  become  more  complex,  the  special 
aspects  of  PROGRAMMAR  become  more  and  more  Important.  The 
flexibility  of  writing  a  grammar  as  a  program  Is  needed  both  to 
handle  the  complexities  of  English  syntax,  and  to  combine  the 
semantic  analysis  of  language  with  the  syntactic  analysis  In  an 
Intimate  way.  Section  2.3  describes  a  fairly  complex  grammar  of 
English,  and  section  4.2  describes  the  way  It  Is  Integrated  v^lth 
the  semantic  programs.  A  number  of  the  other  features  and 
details  of  f^ROGRAMMAR  are  described  In  Section  2.4. 
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2.2.7  Ambiguity  jLOi  Understanding 

Readers  familiar  with  parsing  systems  may  by  now  have 
v/ondered  about  the  problem  of  ambiguity.  As  explained,  a 
PROGRAMMAR  program  tries  to  find  a  possible  parsing  for  a 
sentence,  and  as  soon  as  It  succeeds.  It  returns  Its  answer. 

This  Is  not  a  defect  of  the  system,  but  an  active  part  of  the 
concept  of  language  for  which  It  was  designed.  The  language 
process  Is  not  segmented  Into  the  operation  of  a  parser, 
follov/ed  by  the  operation  of  a  semantic  Interpreter.  Rather, 
the  process  Is  unified,  with  the  results  of  semantic 
Interpretation  being  used  to  guide  the  parsing.  This  Is  very 
difficult  In  other  forms  of  grammar,  with  their  restricted  types 
of  context-dependence.  But  It  Is  straightforward  to  Implement 
In  PROGRAMMAR.  For  example,  the  last  statement  In  a  program  for 
NP  may  be  a  call  to  a  noun-phrase  semantic  Interpreter.  If  I" 

Is  Impossible  to  Interpret  the  phrase  as  It  Is  found,  the 
parsing  Is  Immediately  redirected. 

The  way  of  treating  ambiguity  Is  not  through  listing  all 
1241  possible  Interpretations  of  a  sentence,  but  in  being 
intelligent  In  looking  for  the  first  one,  and  being  even  more 
Intelligent  In  looking  for  the  next  one  If  that  falls.  There  Is 
no  automatic  backup  mechanism  In  PROGRAMMAR,  because  blind 
automatic  backup  Is  tremendously  Inefflcent.  A  good  PROGRAMMAR 
program  will  check  Itself  when  a  failure  occurs,  and  based  on 
the  structures  It  has  seen  and  the  reasons  for  the  failure.  It 
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wM ]  decide  specifically  what  should  be  tried  next.  This  Is  the 
reason  for  Internal  fal lure-messsages#  and  there  are  facilities 
for  performing  the  specific  backup  steps  necessary.  (See 
section  2,4.5) 

As  a  concrete  example,  we  might  have  the  sentence  "I  rode 
down  the  street  In  a  car."  At  a  certain  point  In  the  parsing, 
the  NP  program  may  come  up  with  the  constituent  "the  street  In  a 
car".  Before  going  on,  the  semantic  analyzer  will  reject  the 
phrase  "In  a  car"  as  a  possible  modifier  of  "street",  and  the 
program  will  attach  It  Instead  as  a  modifier  of  the  action 
represented  by  the  sentence.  Since  the  semantic  system  Is  a 
part  of  a  complete  deductive  understander,  with  a  definite 
world-model,  the  semantic  evaluation  which,  guides  parsing  can 
Include  both  general  knov/ledge  (cars  don't  contain  streets)  and 
specific  knowledge  (Melvin  owns  a  red  car,  for  example).  Humans 
take  advantage  of  this  sort  of  knowledge  In  their  understanding 
of  language,  and  It  has  been  pointed  out  by  a  number  of 
linguists  and  computer  scientists  that  good  computer  handling  of 
language  will  not  be  possible  unless  computers  can  do  so  as 
well.  ' 

Few  sentences  seem  ambiguous  to  humans  when  first  read. 

They  are  guided  by  an  understanding  of  what  Is  said  to  pick  a 
single  parsing  and  a  very  few  different  meanings.  By  using  this 
same  knowledge  to  guide  Its  parsing,  a  computer  understanding 
system  can  take  advantage  of  the  same  technique  to  parse 
meaningful  sentences  quickly  and  efficiently. 

We  must  be  careful  to  distinguish  between  grammatical  and 
semantic  ambiguity.  Although  we  want  to  choose  a  single  parsing 
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without  considering  the  alternatives  simultaneously,  we  want  to 
handle  semantic  ambiguity  very  differently.  There  may  be 
several  Interpretations  of  a  sentence  which  are  all  more  or  less 
meaningful,  and  the  choice  between  them  will  depend  on  a  complex 
evaluation  of  our  knowledge  of  the  world,  of  the  knowledge  the 
person  speaking  has  of  the  world,  and  of  what  has  been  said 
recent  1 y . 


Section  2.2.8  -  Page  91 


2.2.8  SufTimarv 

In  understanding  the  reason  for  developing  PROGRAMMAR, 
several  factors  are  Important.  The  ^Irst  Is  that  only  through 
the  flexibility  of  expressing  a  grammar  as  a  program  can  we 
Introduce  the  type  of  intelligence  necessary  for  complete 
language  understanding.  PROGRAMMAR  Is  able  to  take  Into  account 
the  fact  that  language  Is  structured  In  order  to  convey  meaning, 
and  that  our  parsing  of  sentences  depends  Intimately  on  our 
understanding  that  meaning.  PROGRAMMAR  can  take  advantage  of 
this  to  deal  more  efficiently  with  natural  language  than  a 
general  rule-based  system,  whether  context-free  or 
transformational.  More  Important,  the  analysis  returned  by 
PROGRAMMAR  Is  designed  to  serve  as  a  part  of  a  total 
understanding  process,  and  to  lend  Itself  directly  to  semantic 
Interpretatic  ).  This  was  one  reason  for  selecting  systemic 
grammar,  and  has  guided  much  of  the  design  of  the  system.  The 
exact  way  In  which  semantic  Interpretation  can  be  done,  and  the 
reasons  why  a  systemic  analysis  Is  Important  will  be  discussed 
In  sections  4.1  and  4.2. 
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2.3  A  Grammar  of  English 
2.3.1  About  ihS.  Qranimax 

This  section  describes  the  grammar  of  English  used  by  our 
system.  It  Is  based  on  the  principles  of  systemic  gramma r 
(<Hal1lday  1961,  1966a,  1966b,  1967>),  and  emphasizes  the 
analysis  of  the  significant  features  exhibited  by  lingustic 
structures,  rather  than  their  detailed  form.  Instead  of  giving 
a  set  of  grammar  rules  or  algorithms,  this  section  will  try  to 
convey  an  Impressionistic  overview  of  English  grammar,  giving 
examples  of  the  most  Important  features.  Section  2.3.11  follows 
the  parser  through  two  sample  sentences,  showing  how  the  grammar 
Is  used.  The  actual  PROGRAMMAR  grammar  contains  the  details, 
and  Is  available  on  request.  A  few  particularly  Interesting 
parts  of  the  detailed  grammar  are  described  In  section  2.U. 
Appendix  A  Is  a  glossary  of  features  with  references  to  the 
pages  on  which  they  are  Illustrated. 

Before  beginning,  several  warnings  are  In  order.  First, 
this  Is  not  by  any  means  a  complete  grammar  of  English.  The 
task  of  codifying  an  entire  language  In  any  formalism  Is  so 
large  that  It  would  be  folly  to  try  In  the  course  of  a  single 
research  project.  Our  goal  was  to  cover  a  large  enough  portion 
of  English  syntax  so  that  a  user  could  converse  comfortably  with 
the  system  about  Its  toy-block  world.  There  are  whole  areas  of 
syntax  which  are  Involved  with  conveying  Information  of  types 
not  included  In  this  narrow  field  (such  as  the  emotional 
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reaction,  mood,  and  emphasis  of  the  speaker).  These  are  not 
handled  at  all,  and  even  within  the  toy-block  world,  there  are 
numerous  sentences  and  constructions  which  the  grammar  Is  not 
yet  equipped  to  handle.  It  will  be  of  Interest  to  see  whether 
the  basic  structure  of  the  syntactic  theory  Is  flexible  enough 
to  add  the  great  amount  of  complexity  which  could  be  Included  In 
a  more  complete  grammar. 

Second,  the  grammatical  theory  Is  used  In  a  very  Impure 
way.  The  main  consideration  was  to  produce  a  working  grammar 
which  could  serve  In  a  language-understanding  program.  The 
demands  of  practicality  often  overrode  more  theoretical 
criteria,  and  the  resulting  grammar  Is  not  very  "pretty".  This 
Is  especially  true  since  It  has  evolved  In  a  continuous  process 
of  writing  and  debugging,  and  has  not  yet  undergone  the 
"polishing"  which  removes  the  traces  of  Its  earlier  stages  of 
development. 

Demands  of  time  made  It  Impossible  to  coordinate  the 
writing  of  the  grammar  with  other  current  versions  of  systemic 
grammar,  so  the  analysis  Is  non-standard,  often  disagreeing  with 
Halllday's  analysis  or  other  more  complete  versions.  Some 
differences  are  simply  notatlonal  (using  different  names  for  the 
same  thing),  others  are  Intentional  simplifications  (Halllday's 
analysis  Is  much  more  complete),  and  some  represent  actual 
theoretical  differences  (for  example,  our  analysis  of  the 
transitivity  system  puts  much  of  the  structure  Into  the  semantic 
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rather  than  syntactic  rules^  while  Halllday's  Is  more  purely 
syntactic.).  We  will  not  describe  the  differences  ir  detail., 
since  this  Is  not  a  proposal  for  a  specific  version  of  English 
grammar.  It  Is  Instead  a  proposal  for  a  way  of  looking  at 
language/  and  at  English,  pointing  out  some  of  the  Interesting 
features. 
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2.^1^  Units.  R^nk/  Features 

We  win  begin  by  describing  some  of  the  basic  concepts  of 
systemic  grammar#  before  giving  details  of  their  use  In  our 
analysis.  Some  of  the  description  Is  a  repetition  of  material 
In  Section  2.1.  In  that  section  we  needed  to  give  enough 
explanation  of  systemic  grammar  to  explain  PROGRAMMAR.  Here  we 
give  a  more  thorough  explanation  of  Its  details. 

The  first  is  the  notion  of  syntactic  uni ts  In  analyzing  the 
constituent  structure  of  a  sentence  (the  way  it  Is  built  up  cut 
of  smaller  parts).  if  we  Icok  at  other  forms  of  grammar#  we  see 
that  syntactic  structures  are  usually  represented  as  a  binary 
tree#  with  many  levels  of  branching  and  few  branches  at  any 
node.  The  tree  is  not  organized  Into  "groupings"  of  phrases 
which  are  used  for  ccnve/Ing  different  parts  of  the  meaning. 

For  example#  the  sentence  "Th<.  three  big  red  dogs  ate  a  raw 
steak."  would  be  parsed  with  something  like  the  first  tree  In 
Figure  25. 

Systemic  grammar  pays  more  attention  to  they  way  language 
is  organized  Into  units#  each  of  which  has  a  special  role  In 
conveying  meaning.  In  English  we  can  distinguish  three  basic 
ranks  of  units#  the  CLAUSE#  the  GROUP#  and  the  WORD.  There  are 
several  types  of  groups:  NOUN  GROUP  (NG)#  VERB  GROUP  <VG) 
PREPOSITION  GROUP  (PREPG)  and  ADJECTIVE  GROUP  (ADJG) .  In  a 
systemic  grammar#  the  same  sentence  might  be  viewed  as  having 
the  second  structure  In  Figure  25. 
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Figure  25  -  Parsing  T reas 
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In  this  analysis^  the  VJORD  Is  the  basic  building  block. 

There  are  word  classes  like  "adjective"  ,  "noun",  "verb",  and 
each  word  Is  an  Integral  unit  --  It  Is  not  chopped  Into 
hypothetical  bits  (like  analyzing  "dogs"  as  being  composed  of 
"dog"  and  "-s"  or  "dog"  and  "plural").  Instead  we  vlev#  each 
word  as  exhibiting  features .  The  word  "dogs"  is  the  same  basic 
vocabulary  item  as  "dog",  but  has  the  feature  "plural"  Instead 
of  "singular".  The  words  "took",  "take",  "taken",  "taking", 
etc.,  are  all  the  same  basic  word,  but  with  differing  features 
such  as  "past  participle"  (EN),  "Infinitive"  (INF),  "-ing" 

(ING),  etc.  When  discussing  features,  we  will  use  several 
notational  conventions.  Any  word  appearing  In  all  upper-case 
letters,  is  the  actual  symbol  used  to  represent  a  feature  In  our 
gram.mar  and  semantic  programs.  A  feature  name  enclosed  in 
quotes  Is  an  English  version  which  Is  more  Informative.  Usually 
the  program  version  Is  an  abbreviation  of  the  urgllsh  version, 
and  sometimes  we  vdll  Indicate  this  by  typing  the  letters  of  the 
abbreviation  in  upper-case,  and  the  rest  in  lower-case.  Thus  if 
"determiner"  Is  abbreviated  as  DET,  we  may  write  DETermlner.  We 
may  even  write  things  like  QuaNTIFleR.  When  we  want  to  be  more 
careful,  we  will  write  "quantifier"  (O.NTFR). 

The  next  larger  unit  than  the  WORD  Is  the  GROUP,  of  v^hlch 
there  are  the  four  types  mentioned  above.  Each  one  has  a 

particular  function  In  conveying  meaning.  Noun  groups  (NG)  •* 

describe  objects,  verb  groups  (V6)  carry  complex  messages  about 
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the  time  and  modal  (logical)  status  of  an  event  or  relationship, 
preposition  groups  (PREPG)  describe  simple  relationships,  while 
adjective  groups  (ADJG)  convey  other  kinds  of  relationships  and 
descriptions  of  objects.  These  semantic  functions  are  described 
In  more  detail  In  section  4.2. 

Each  GROUP  can  have  "slots”  for  the  words  of  which  It  Is 
composed.  For  example,  a  NG  has  slots  for  "determiner"  (DET), 
"numbers"  (NUM),  "adjectives"  (AOJ),  "classifiers"  (CLASP),  and 
a  NOUN.  Each  group  can  also  exhibit  features,  just  as  a  vord 
can.  A  NG  can  be  "singular"  (NS)  or  "plural"  (NPL),  "definite" 
(DEF)  as  In  "the  three  dogs"  or  "Indefinite"  (INDEF)  as  In  "a 
steak",  and  so  forth.  A  VG  can  be  "negative"  (NEG)  or  not,  can 
be  MODAL  (as  In  "could  have  seen"),  and  It  has  a  tense.  (See 
Section  2.3.8  for  an  analysis  of  complicated  tenses,  such  as  "He 
would  have  been  going  to  be  fixing  It.") 

Finally,  the  top  rank  Is  the  CLAUSE.  We  speak  of  clauses 
rather  than  sentences  since  the  sentence  Is  more  a  unit  of 
discourse  and  semantics  than  a  separate  syntactic  structure.  It 
Is  either  a  single  clause  or  a  series  of  clauses  joined  together 
In  a  simple  structure  such  as  "A  and  B  and...".  We  study  these 

conjoining  structures  separately  since  they  occur  at  all  ranks, 
and  there  Is  no  real  need  to  have  a  separate  syntactic  unit  for 
sentence. 

The  clause  Is  the  most  complex  and  diverse  unit  of  the 
language,  and  Is  used  to  express  complex  relationships  and 
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events#  Involving  time#  place#  manner  and  many  other  aspects  of 
meaning.  It  can  be  a  QUESTION#  a  DECLARATIVE#  or  an  IMPERATIVE# 
It  can  be  "passive*"  (PASV)  or  "active"  (ACTV)#  It  can  be  a  YES- 
NO  question  or  a  WH-  question  (like  *'Why...?"  or  "Which...?"). 

Looking  at  our  sample  parsing  tree#  Tree  2  In  Figure  25.  we 
see  that  the  clauses  are  made  up  of  groups#  which  are  In  turn 
made  up  of  words  However  few  sentences  have  this  simple  three- 
layer  structure.  Groups  often  contain  other  groups  (for 
example#  "the  call  of  the  wild"  is  a  NG#  which  contains  the 
PREPG  "of  the  wild"  which  in  turn  contains  the  MG  "the  wild"). 
Clauses  can  be  parts  of  other  clauses  (as  In  "Join  the  Navy  to 
see  the  worl d.") .  and  can  be  used  as  parts  of  groups  In  many 
different  ways  (for  example#  in  the  MG  "the  man  who  came  to 
dinner"  or  the  PREPG  "by  leaving  the  country'*. )  This  phenomenon 
is  called  ranksh 1  ft.  and  Is  one  of  the  basic  principles  of 
systemic  grammar. 

If  the  units  can  appear  anywhere  In  the  tree#  what  Is  the 
advantage  of  grouping  constituents  Into  "units"  Instead  of 
having  a  detailed  structure  like  the  one  shown  In  our  first 
parsing  tree?  The  answer  Is  In  the  "features"  we  were  noting 
above.  Each  unit  has  associated  with  It  a  set  of  features# 
which  are  of  primary  significance  In  conveying  meaning.  We 
mentioned  that  a  clause  could  have  features  such  as  IMPERATIVE# 
DECLARATIVE#  QUESTION#  ACTV#  PASV#  YES-NO#  and  WH-.  These  are 
not  unrelated  observations  we  can  make  about  a  clause.  They  are 
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related  by  a  definite  logical  structure.  The  choice  between 
YES-MO  and  WH-  Is  meaningless  unless  the  clause  Is  a  0,UESTI0M# 
but  If  It  Is  a  QUESTION,  the  choice  must  be  made.  Similarly, 
the  choice  between  QUESTION,  IMPERATIVE,  and  DECLARATIVE  Is 
mandatory  for  a  MAJOR  clause  (one  which  could  stand  alone  as  a 
sentence),  but  Is  not  possible  for  a  "secondary"  (SEC)  clause, 
such  as  "the  country  wh Ich  possesses  the  bomb."  The  choice 
between  PASV  (as  In  "the  ball  was  attended  by  John",)  and  ACTV 
(as  In  "John  attended  the  ball.")  Is  on  a  totally  different 
dimension,  since  It  can  be  made  regardless  of  which  of  these 
other  features  are  present. 

We  can  represent  these  logical  relationships  graphically 
using  a  few  simple  conventions.  A  set  of  mutually  exclusive 
features  (such  as  QUESTION,  DECLARATIVE,  and  IMPERATIVE)  Is 
called  a  system,  and  Is  represented  by  connecting  the  features 
with  3  vertical  bar; 

QUESTION 

DECLARATIVE 

IMPERATIVE 

The  vertical  order  Is  not  Important,  since  a  system  Is  a 
set  of  unordered  features  among  which  we  will  choose  one.  Each 
system  has  an  entry  cond 1 1 1  on  which  must  be  satisfied  In  order 
for  the  choice  to  be  meaningful.  This  entry  condition  can  be  an 
arbitrary  boolean  condition  on  the  presence  of  other  features. 
The  simplest  case  (and  most  common)  Is  the  presence  of  a  single 
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other  feature.  For  example/  the  system  just  depicted  has  the 
feature  MAJOR  as  Its  entry  condl 1 1  on, ,  s I  nee  only  MAJOR  clauses 
make  the  choice  between  DECIARATIVE/  IMPERATIVE/  and  QUESTION. 

This  simple  entry  condition  Is  represented  by  a  horizontal 
line/  with  the  condition  on  the  left  of  the  system  being 
entered.  We  can  diagram  some  of  our  CLAUSE  features  as; 

DECLARATIVE 

MAJOR -  IMPERATIVE 

SEC  Iquestiom - 


CLAUSE 


H 


YES-MO 

v;h- 


Often  there  are  Independent  systems  of  choices  sharing  t!.e 
same  entry  condition.  For  examplo^  the  choice  between  SEC  and 
MAJOR  and  the  choice  between  PASV  and  ACTV  both  depend  directly 
on  the  presence  of  CLAUSE.  This  tyoe  of  relationship  will  be 

I 

Indicated  by  a  bracket  in  place  of  a  vertical  bar. 


CLAUSE 


If  we  want  to  assign  a  name  to  a  system  (to  .talk  about  It)/ 
we  can  put  the  name  above  the  line  leading  Into  It: 


VOICE 


PASV 

ACTV 


We  can  look  at  these  notations  as  representing  the  logical 
operations  of  "or"  and  "and"/  and  we  can  use  them  to  represent 
more  complex  entry  conditions.  if  the  choice  between  the 
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features  C  and  D  depends  on  the  presence  of  either  A  £Lt  8/ 
draw; 


A 

B 


D 

C 


and  I  f 
presence 


the  entry  condition  for  the 
of  both  A  and  B/  we  write; 


"C-0" 


system 


Is 


the 


Finally,  we  can  allow  "unmarked"  features.  In  cases  where 
the  choice  Is  between  the  presence  or  absence  of  something  of 
Interest.  We  might  have  a  system  like; 


NEGATIVITY 


NEGATIVE 


In  which  the  feature  "non-negative"  Is  not  given  a  name,  but  Is 
assumed  unless  the  feature  NEGATIVE  Is  present. 

We  will  explain  our  grammar  by  presenting  the  system 
networks  for  all  three  ranks  --  CLAUSE,  GROUP, and  WORD,  and 
giving  examples  of  sentences  exhibiting  the  features.  We  have 
not  attempted  to  show  all  of  the  logical  relationships  In  the 
networks  --  our  networks  may  Indicate  combinations  of  features 
which  are  actually  not  possible,  and  would  need  a  more  complex 
network  to  represent  properly.  We  have  chosen  clarity  over 
completeness  whenever  there  was  a  conflict.  In  addition,  we 
have  represented  "features"  of  units  (I.e.  descriptions  of  their 
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structure)  and  ’’functions"  (descriptions  of  their  use)  In  the 
same  network.  In  a  more  theoretical  presentat I  on#  It  would  be 
preferable  to  distinguish  the  two.  The  names  chosen  for 
features  were  arbitrary  mnemonics  Invented  as  they  were  needed/ 
and  are  neither  as  clear  nor  as  systematic  as  they  might  be  In  a 
"cleaned  up"  version. 
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Ills  gLAg.S£ 

The  structure  exhibiting  the  greatest  variety  In  English  Is 
the  CLAUSE.  It  can  express  relationships  and  events  Involving 
time,  place,  manner,  and  other  modifiers.  Its  structure 
indicates  what  parts  of  the  sentence  the  speaker  wants  to 
emphasize,  and  can  express  various  kinds  of  focus  of  attention 
and  emotion.  It  determines  the  purpose  of  an  utterance  -- 
whether  It  Is  a  question,  command,  or  statement  --  and  Is  the 
basic  unit  which  can  stand  alone.  Other  units  can  occur  by 
themselves  when  their  purpose  Is  understood,  as  In  answer  to  a 
question,  but  the  clause  Is  the  primary  unit  of  discourse. 

The  CLAUSE  has  several  main  Ingredients  and  a  number  of 
optional  ones.  Except  for  special  types  of  Incomplete  clauses, 
there  Is  always  a  verb  group,  containing  the  verb,  which 
Indicates  the  basic  event  or  relationship  being  expressed  by  the 
CLAUSE.  Almost  every  CLAUSE  contains  a  subject,  except  for 
IMPERATIVE  (In  which  the  semantic  subject  Is  understood  to  be 
the  person  being  addressed),  and  embedded  clauses  In  which  the 
subject  lies  somewhere  else  In  the  syntactic  structure.  In 
addition  to  the  subject,  a  CLAUSE  may  have  various  kinds  of 
objects,  which  will  be  explained  In  detail  later.  It  can  take 
many  types  of  modifiers  (CLAUSES,  GROUPS,  and  WORDS)  which 
Indicate  time,  place,  manner,  causality,  and  a  variety  of  other 
aspects  of  meaning.  One  part  of  the  CLAUSE  system  network  Is 
shown  In  Figure  26. 
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CLAUSE 


J- 

MAJOR - IDE 

Iqu 


IPERATIVE 
DECLARATIVE 
QUESTION 


I  DANGLING 


H 


YES-NO 


SEC« 


ADJUNCT 


RSQ- 


H 

EN 

TO 

IMG 

WHRS* 


BOUND 

TO 

ING 


r 


RSNG- 


V.'HRS 

ING 

TO 

REPORT 

SUBJ 


PREP*- 


_ jSHORT 


ADJ* 
ADVMEAS* 


NG*- 


_ ^  SUB ING 

_ jSUBTO 


■  THAT 
j  ITSUBa 


OBJ - 

I  — 

RELDEL 
UPREL 


OBJl 

OBJ2 

LOBJ 

PREPOBJ 


SUB* 


0BJ*H 


HSUBJ* 
SUBJT* 
OBJl* 
OBJ2* 
LOBJ* 
TRANS2TO* 


HMEAS* 

TIME* 

DOWN* 


Figure  26 


NETWORK  1 
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Beginning  at  the  top  of  the  network/  we  see  a  choice 
between  MAJOR  (a  clause  which  could  stand  alone  as  a  sentence) 
and  "secondary"  (SEC).  A  MAJOR  clause  Is  either  an  IMPERATIVE 
(a  command),  a  DECLARATIVE,  or  a  QUESTION.  Questions  are  either 
YES-NO  —  answerable  by  "yes"  or  "no",  as  In; 

(si)  Did  you  like  the  show? 

or  WH-  (Involving  a  question  element  like  "when",  "where", 
"which",  "how",  etc.).  The  choice  of  the  WH-  feature  leads  Into 
a  whole  network  of  further  choices,  which  are  shared  by  QUESTION 
and  two  kinds  of  SECondary  clauses  we  will  discuss  later.  In 
order  to  share  the  network,  we  have  used  a  simple  notatlonal 
trick  --  the  symbols  contain  a  and  when  they  are  being 

applied  to  a  question,  we  replace  the  *  with  "Q",  while  when 
they  are  applied  to  relative  clauses,  we  use  "REL".  For 
example,  the  feature  "PREP*"  In  the  network  will  be  referred  to 
as  PREPQ  when  we  find  It  In  a  question,  but  PREPREL  when  It  Is 
In  a  relative  clause.  This  Is  due  to  the  way  the  grammar 
evolved,  and  In  later  versions  we  will  probably  use  only  one 
name  for  these  features.  This  complex  of  features  Is  basically 
the  choice  of  what  element  of  the  sentence  Is  being  questioned. 
English  allows  us  to  use  almost  any  part  of  a  clause  as  a 
request  for  Information.  For  example.  In  a  PREPQ,  a 
prepositional  group  In  the  clause  Is  used,  as  In: 

(s2)  With  what  did  you  erase  It? 

We  more  commonly  find  the  preposition  In  a  DANGLING  position,  as 
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In: 

(s3)  What  did  you  erase  It  wi th? 

We  can  tell  by  tracing  back  through  Network  1  that  sentence  s3 
has  the  features  PREPQ,  DANGLING.  WH-,  QUESTION,  and  MAJOR. 

We  can  use  a  special  question  adverb  to  ask  questions  of 

time,  place,  and  manner,  as  In: 

<s4)  Why  did  the  chicken  cross  the  road? 

(s5)  When  were  you  born? 

(s6)  How  will  you  tell  her  the  news? 

(s7)  Where  has  my  little  dog  gone? 

These  are  all  marked  by  the  feature  ADJQ.  In  discourse  they  can 

also  appear  In  a  short  form  (SHORT)  In  which  the  entire 

utterance  Is  a  single  word,  as  In: 

<s8)  V-Ihv? 

We  can  use  the  word  "how**  In  connection  with  a  measure  adverb 
(like  "fast”)  to  ask  an  ADVMEASQ,  like: 

(s9)  How  fast  can  he  run  the  mile? 

The  most  flexible  type  of  WH-  question  uses  an  entire  noun 
group  as  the  question  element,  using  a  special  pronoun  (like 
"what"  or  "who")  or  a  determiner  (like  "which",  or  "how  many") 
to  Indicate  that  It  is  the  question  element.  These  clauses  have 
the  feature  NGC.,  and  they  can  be  further  divided  according  to 
the  function  of  the  NG  In  the  clause.  It  can  have  any  of  the 
possible  NG  functions  (these  will  be  described  more  formally 
with  regard  to  the  next  network).  For  example.  It  can  be  the 
subject,  giving  a  SUBJ(),  like: 

(slO)  Which  hand  holds  ths  M  and  M's? 
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It  can  be  the  subject  of  a  THERE  clause  (see  below)^  giving  us  a 
SUBJTQ; 

(sll)  How  many  Puerto  R I  cans  are  there  In  Boston? 

A  complement  Is  the  second  half  of  an  "Is”  clause/  like; 
(sl2)  Her  hair  Is  red . 
and  It  can  be  used  to  form  a  COMPQ: 

(sl3)  What  color  was  her  hair? 
or  with  a  "measure"  In  a  MEASQ: 

(slk)  How  deep  Is  the  ocean? 

The  noun  group  can  be  an  object/  leading  to  the  feature 
OBJQ/  as  In; 

(sl5)  VJhat  do  you  want?  or 

(sl6)  Who  did  you  give  the  book? 

These  are  both  06J1Q/  since  the  first  has  only  one  object 
("what")/  and  the  second  ^uest)ons  the  first/  rather  than  the 
second  object  ("who"/  Instead  of  "the  book").  We  use  the 
ordering  of  the  DECLARATIVE  form  "You  gave  aa  the  book".  If 
this  were  reversed/  we  would  have  an  06J2Q/  like: 

(sl7)  What  did  you  give  him? 

If  we  use  the  word  "to"  to  express  the  first  object  with  a 

two  object  verb  like  "give"/  we  can  get  a  TRANST02Q/  like: 

(sl8)  To  whom  did  you  give  the  book?  or 

(sl9)  Who  did  you  give  the  book  to? 

Sometimes  a  NG  can  be  used  to  Indicate  the  time  In  a 
clause/  giving  us  a  TIMEQ; 

(;^20)  What  dav  will  the  Icsman  come? 
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In  a  more  complex  style#  we  can  embed  the  question  element 

within  an  embedded  clause#  such  as: 

(s21)  Wh I ch  ca r  did  your  brother  say  that 

he  was  expecting  us  to  tell  Jane  to  buy? 

The  NG  "which  car"  Is  the  question  element#  but  Is  In  fact 
the  object  of  the  clause  "Jane  to  buy..."#  which  Is  embedded 
several  layers  deep.  This  kind  of  NGQ  Is  called  DOWNCl.  The 
role  of  the  question  element  In  the  embedded  clause  can  Include 
any  of  those  which  we  have  been  describing.  For  example  It 
could  be  the  obj'ect  of  a  preposition#  as  In 

(s22)  VJhat  state  did  you  say  Lincoln  was  born  I n? 

Looking  at  the  network  for  the  features  of  SECondary 

clauses#  we  see  three  main  types  —  ADJUNCT#  "Rank-Shifted 

Qualifier"  (RSQ)#  and  "Rank-Shifted  to  act  as  a  Noun  Group" 

(RSNG).  ADJUNCT  clauses  are  used  as  modifiers  to  other 

clauses#  giving  time  references#  causal  relationships#  and  other 

similar  Information.  We  can  use  a  BOUND  clause  containing  a 

"binder"  such  as  "before"#  "while"#  "because"#  "If"#  "so"# 

"unless"#  etc,#  as  In: 

(s23)  While  Nero  fiddled.  Rome  burned. 

(s24)  if  JLt  ra  I  ns#  stay  home. 

(s25)  Is  the  sky  blue  because  I t  I s  col d? 

To  express  manner  and  purpose#  we  use  a  TO  clause  or  an  ING 
clause: 

(s25)  He  died  is  save  us  from  our  sins. 

(s27)  The  bridge  was  built  us Ing  orlnl tive  tool s . 


The  RSQ  clause  Is  a  constituent  of  a  NG#  following  the  noun 
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In  the  "qualifier"  position  (see  Section  2.3.5  for  a  description 

of  the  positions  In  a  NG) .  It  Is  one  of  the  most  commonly  used 

secondary  clauses,  and  can  be  of  four  different  types.  Three  of 

them  are  classified  by  the  form  of  the  verb  group  within  the 

clause  --  TO,  ING,  and  EN  (where  w>e  use  "en"  to  represent  a  past 

participle,  such  as  "broken"): 

(s28)  the  man  see  about  a.  i ob 
(s29)  the  piece  hoi  ding  the  door  on 
(s30)  a  face  weathered  by  sun  gpd  v<l nd 

Notice  that  the  noun  being  modified  can  have  various  roles 
In  the  clause.  In  examples  28  and  29,  "piece"  Is  the  subject  of 
"hold",  while  "man"  Is  the  object  of  "see".  We  could  have  said: 
(s31)  the  man  the  Job 

In  which  "man"  Is  the  subject  of  "do".  Our  semantic  analysis 
sorts  out  these  possibilities  In  determining  the  meaning  of  a 
secondary  clause. 

The  fourth  type  of  RSQ  clause  Is  related  to  WH-  questions, 
and  Is  called  a  WHRS.  It  uses  a  wh-  element  like  "which"  or 
"what",  or  a  word  like  "that"  to  relate  the  clause  to  the  noun 
It  Is  modifying.  The  different  ways  It  can  use  this  "relating" 
element  are  very  similar  to  the  different  possibilities  for  a 
question  element  In  a  WH-  question,  and  In  fact  the  two  share 
part  of  the  network.  Here  we  use  the  letters  REL  to  Indicate  we 
are  talking  about  a  relative  clause,  so  the  feature  PREP*  In 
Network  1  becomes  PREPREL.  In  sentences  (s2)  through  (s22),  we 
Illustrated  the  different  types  of  WH-  questions.  We  can  show 
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parallel  sentences  for  WHRS  RSQ  clauses.  The  following  list 

shows  som»  examples  and  the  relevant  feature  names: 

(s32)  the  thing  wl th  which  you  erased  JLt  PREPREL 

(s33)  the  thing  that  vou  erased  1 t  wl th  PREPREL  DANGLING 

(s34)  the  reason  whv  the  ch I cken  crossed  the  road  RELADJ 

(s35)  the  day  when  vou  were  born  RELADJ 

(s36)  the  way  W£  will  tel  I  her  the  news  RELADJ 

(s37)  the  Place  I  I tt I e  dog  has  gone  RELADJ 

(s38)  the  reason  ^  RELADJ  SHORTREL 

Cs39)  the  hand  wh I ch  rocks  the  crad1 e  SUBJREL 

(s40)  the  number  of  Puerto  Ricans  there  are  Jji  Boston 

SUBJTREL 

(s41)  the  color  her  ha  I r  was  last  week  COMPREL 
(s42)  the  depth  the  ocean  will  be  MEASREL 
(s43)  the  Information  that  vou  want  OBJIREL 
(s44)  the  man  vou  gave  the  book  OBJIREL 
(s45)  the  book  wh I ch  vou  gave  h  Im  0BJ2REL 
(s46)  the  man  to  whom  vou  gave  the  book  TRANST02REL 
(s47)  the  man  you  gave  the  book  to  TRANST02REL 
(s48)  the  day  the  I ceman  came  TIMEREL 
(s49)  the  car  vour  brother  saljJ  Ml 

tel  I  Jane  to  buv  DOWNREL 

(s50)  the  state  vou  sa I d  Lincoln  was  born  Jji  DOWNREL 

Notice  that  In  sentences  36,  37,  40,  41,  42,  44,  47,  43, 

49,  and  50,  there  Is  no  relative  word  like  "which”  or  "that". 
These  could  just  as  well  all  have  been  put  In,  but  English  gives 
us  the  option  of  omitting  them.  When  they  are  absent,  the 
CLAUSE  Is  marked  with  the  feature  RELDEL. 

Returning  to  our  network,  we  see  that  there  Is  one  other 

type  of  basic  clause,  the  RSNQ.  This  Is  a  clause  v:hlch  Is  rank* 

shifted  to  serve  as  a  NG.  It  can  function  as  a  part  of  another 

clause,  a  preposition  group,  or  an  adjective  group.  There  are 

four  basic  types.  The  first  two  are  TO  and  ING,  as  In: 

(s51)  1  1  Ike  ifl  ILz.  TO 

(s52)  Building  houses  Is  hard  work.  ING 

(s53)  He  got  It  by  saving  coupons.  ING 
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Notice  that  In  s51,  the  RSNG  clause  Is  the  object  (OBJl)/  In 

s52  It  Is  the  subject  (SUBJ)/  and  In  s53  It  Is  the  object  of  a 

preposition  (PREPOBJ).  We  can  have  a  separate  subject  within 

the  TO  and  ING  clauses,  giving  us  the  features  SUBTO  and  SUBING: 

(s5U)  I  wanted  Ruth  to  1 ead  the  revol ut I  on .  SUBTO 
(s55)  They  liked  John's  leading  It.  SUBING 

The  SUBING  form  takes.  Its  subject  In  the  possessive. 

In  addition  to  ING  and  TO,  we  have’ the  REPORT  CLAUSE,  which 

has  the  structure  of  an  entire  sentence,  and  Is  used  as  a 

participant  In  a  relation  about  things  like  hearing,  knowing, 

and  saying: 

(s56)  She  heard  that  the  other  team  had  won... 

(s57)  That  she  wasn*  t  there  surprised  us. 

(s58)  I  knew  Jt£,  coul  d  do  1 1. 

The  word  "that"  Is  used  In  .s56  and  s57  to  mark  the  beginning 
of  the  REPORT  CLAUSE,  so  they  are  assigned  the  feature  THAT. 

The  absence  of  "that"  Is  left  unmarked. 

If  the  subject  of  a  clause  Is  In  turn  a  RSNG  clause,  we  may 
have  trouble  understanding  It: 

(s59)  That  anyone  who  knew  the  combination  could  h.^YS 
opened  the  1 ock  was  obvious. 

There  Is  a  special  mechanism  for  rearranging  the  sentence  by 
using  the  word  "It",  so  that  the  complicated  subject  comes  last: 
(s60)  It  was  obvious  that  anyone  who  knew  the  combination 
coul d  have  opened  the  lock. 


In  this  case,  we  say  that  the  RSNG  clause  Is  serving  as  an 
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ITSUBJ.  TO  and  I NG  clauses  can  do  the  same: 

{s61)  It  wl!l  be  fun  JLfi  see  them  aea  I  n . 

(s62)  It  was  dangerous  gol ng  up  wi thout  ^  pa rachute . 

The  final  type  of  RSNG  Is  the  WHRS,  which  Is  almost 
Identical  to  the  WHRS  RSQ  described  above.  Rather  than  go 
through  the  details  again#  we  will  Indicate  how  a  few  of  our  RSQ 
examples  (sentences  s32  to  sSO)  can  be  converted#  and  will  leave 
the  reader  to  do  the  rest. 

(s63)  I  don't  know  what  h£  ^  It  with.  PREPREL  DANGLING 

(s54)  Ask  him  when  he  was  born.  RELADJ 

(s65)  He  told  me  whv.  RELADJ  SHORTREL 

(s66)  It  Is  amazing  how  many  Puerto  Ricans  there  are  Jji 

Boston.  SUBJTREL 

(s67)  Only  her  hairdresser  knows  wha  t  color  her  ha  I r  was . 


COMPREL 


etc. 

Let  us  examine  one  case  more  carefully: 

(s68)  I  knew  which  car  your  brother  sa I d  that  he 
wss.  expecting  ut.  te  1 1  Jane  buy,. 

Here  we  have  a  DOWNREL  clause#  "which  car.... buy"#  serving 
as  the  object  of  the  CLAUSE  "I  knew...".  However#  this  means 
that  somewhere  below#  there  must  be  another  clause  with  a  slot 
Into  which  the  relative  element  can  fit.  In  th’s  case#  It  Is 
the  RSNG  TO  clause  "Jane  to  buy"#  which  Is  missing  Its  object. 
This  clause  then  has  the  feature  UPREL#  which  Indicates  that  Its 
missing  constituent  Is  somewhere  above  In  the  structure.  More 
specifically  It  Is  08J1UPREL. 

Once  this  connection  Is  found#  the  program  might  change  the 
pointers  In  the  structure  to  place  the  relative  as  the  actual 
OBJl  of  the  embedded  clause  structure.  In  the  current  grammar. 
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the  pointers  are  left  untouched^  and  special  commands  to  the 
moving  functlo^i  •  are  used  when  the  object  Is  referenced  by  the 
semantic  program. 
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2>3.4  Transitivity  in  lini  nifl.US,g 

In  addition  to  the  systems  we  have  already  described,  there 
Is  a  TRANSITIVITY  system  for  the  CLAUSE,  which  describes  the 
number  and  nature  of  its  basic  constituents.  We  mentioned 
earlier  that  a  CLAUSE  had  such  components  as  a  subject  and 
various  objects.  The  transitivity  system  specifies  these 
exactly.  We  have  adopted  a  very  surface-or lented  notion  of 
transitivity.  In  which  we  note  the  number  and  basic  nature  of 
the  objects,  but  do  not  deal  with  their  semantic  roles,  such  as 
"range**  or  **benef  I  c  I  a  ry** .  Halllday's  analysis  <Hal1lday  1967> 

Is  somewhat  different,  as  It  Includes  aspects  which  we  prefer  to 
handle  as  part  of  the  semantic  analysis.  Our  simplified  network 
Is: 
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CLAUSE  has  a  SUBJect  and  a  COMPIement.  The  COMPlement  can  be 
either  a  MG#  as  In  s70  or; 

(s71)  He  was  ^  agent  cjl  the  FB I  . 
or  a  PREPG: 

(s72)  The  king  was  JLn  the  count  I  ng  house.. 
or  an  ADJG: 

(s73)  Her  strength  was  fantast Ic. 

(s74)  My  daddy  Is  stronger  than  vours . 

Other  clauses  are  divided  according  to  the  number  and  type 
of  objects  they  have.  A  CLAUSE  with  no  objects  Is  Intransitive 
( ITRNS) : 

(s75)  He  Is  running. 

With  one  object  It  Is  transitive  (TRANS): 

(s76)  He  runs  a.  milling  machine.. 

With  two  objects  TRANS2; 

(s77)  I  gave  nui  I ove  ^  cherry. 

Some  verbs  are  of  a  special  type  which  use  a  location  as  a 
second  object.  One  example  is  "put"#  as  In: 

(s78)  Put  the  block  ao,  the  table. 

Note  that  this  cannot  be  considered  a  TRANS  with  a 
modifier#  as  In: 

(s79)  He  runs  a  milling  machine  Jji  Chicago. 
since  the  verb  "put"  demands  that  the  location  be  given.  We 
cannot  say  "Put  the  block."  This  type  of  CLAUSE  Is  called 
TRANSL#  and  the  location  object  Is  the  LOBJ.  The  LOBJ  can  be  a 
PREPG  as  In  s78#  or  a  special  adverb#  such  as  "there"  or 
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"somewhere"/  as  In: 

(s80)  Where  did  you  put  It?  cr 
(s81)  Put  Ft  there. 

Some  Intransitive  verbs  also  need  a  locational  object  for 
certain  meanings/  such  as; 

(s82)  The  block  Is  sitting  afl.  the  tabl e . 

This  Is  called  ITRNSL. 

Finally/  there  are  INTenslve  clauses  which  are  not  BE 

clauses/  but  which  have  a  COMPlement/  as  In: 

(s83)  He  felt  sick.  and 

<s84)  He  made  me  sick. 

We  have  not  run  Into  these  with  our  simple  subject  matter/  and  a 
further  analysis  will  be  needed  to  handle  them  properly. 

Any  of  the  constituents  we  have  been  mentioning  can  be 
modified  or  deleted  when  these  features  Interact  with  the 
features  described  in  Network  1.  For  example  In: 

(s85)  the  block  which  I  told  vou  to  out  2Ji  the  table 
the  underlined  CLAUSE  Is  TRANSL/  but  Its  OBJl  Is  missing  since 
It  Is  an  UPREL. 

English  has  a  way  of  making  up  new  words  by  combining  a 
verb  and  a  "particle"  (PRT)/  producing  a  combination  like  "pick 
up"/  "turn  on"/  "set  off"/  or  "drop  out".  These  do  not  simply 
combine  the  meanings  of  the  verb  and  particle/  but  there  Is  a 
special  meaning  attached  to  the  pair/  which  may  be  very 
different  from  either  word  In  Isolation.  Our  dictionary 
contains  a  table  of  such  pal.*S/  and  the  grammar  programs  use 
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them.  A  CLAUSE  whose  verb  Is  a  part  of  PRT  pair  has  the  feature 
PRT.  The  particle  can  appear  either  Immediately  after  the  word: 
(s86)  He  threw  away  the  plan. 

or  In  a  displaced  position  (marked  by  the  feature  DPRT): 

(s87)  He  threw  the  plans  a wav. 

Regardless  of  whether  there  Is  a  PRT  or  not,  we  have  the 
choice  between  the  features  passive  (PASV)  and  active  (ACTV). 
aCTV  places  the  semantic  subject  first: 

(s88)  The  Pres  I  dent  started  the  war. 
while  PASV  puts  the  semantic  object  first: 

(s89)  The  war  was  started  by  the  President. 

If  there  Is  a  PREPG  beginning  with  "by".  It  Is  Interpreted  as 
the  semantic  subject  (as  In  s89},  and  the  CLAU^ E  has  the  feature 
AGENT. 

If  the  CLAUSE  Is  active  and  1 15^  subejet  Is  a  RSNG  CLAUSE, 
we  can  use  the  IT  form  described  ear’Ie'*.  This  Is  marked  by  the 
feature  IT,  and  Its  subject  Is  marked  ITSUBJ,  as  In  sentences 
60,  61,  and  62. 
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2.3.5  Noun  grotiP? 

The  best  way  to  explain  the  syntax  of  the  NOUN  GROUP  Is  to 
look  at  the  "slot  and  filler"  analysis/  which  describes  the 
different  components  It  can  have.  Some  types  of  NG/  such  as 
those  with  pronouns  and  proper  nouns,  will  not  have  this  same 
construction,  and  they  will  be  explained  separately  later. 

We  will  diagram  the  typcial  NG  structure,  using  a  to 

indicate  that  the  same  element  can  occur  more  than  once.  Most 
of  these  "slots"  are  optional,  and  may  or  may  not  be  filled  In 
any  particular  NG.  The  msanings  of  the  different  symbols  are 
explained  below.. 


I  !  I  I  i  I  i 

DET  OHO  NUM  ADw*  CLASP*  NOUN  Q* 
Figure  28  --  NG  Structure 


The  most  important  Ingredient  Is  the  NOUN,  which  is  almost 
always  present  (If  It  Isn't,  the  NG  Is  INCOMplete).  It  gives 
the  basic  Information  about  the  object  or  objects  being  referred 
to  by  the  NG.  Immediately  preceding  the  NOUN,  there  are  an 
arbitrary  number  of  "classifiers"  (CLASP).  Examples  of  CLASP 
are; 

(s90)  Plant  life 

(s91)  walfir  meter  cover  adjustment  screw 
Notice  that  the  same  class  of  words  can  serve  as  CLASP  and  NOUN 
--  in  fact  Halllday  uses  one  word  class  (called  NOUN),  and 
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distinguishes  between  the  functions  of  "head"  and  "classifier". 
We  have  separated  the  two  because  our  dictionary  gives  the 
meaning  of  words  according  to  their  word  class^  and  nouns  often 
have  a  special  meaning  when  used  as  a  CLASP. 

Preceding  the  CLASFs  we  have  adjectives  (AOJ)#  such  as  "big 
beautiful  soft  red..."  We  can  distinguish  adjectives  from 
classifiers  by  the  fact  that  adjectives  can  be  used  as  the 
complement  of  a  BE  CLAUSE/  but  classifiers  cannot.  We  can  say 
"red  hair"/  or  "horse  hair"/  or  "That  hair  is  red."/  but  we 
cannot  say  "That  hair  is  horse."/  since  "horse"  is  a  CLASP/  not 
an  AOJ.  Adjectives  can  also  take  on  the  COMPARatlve  and 
superlative  forms  ("red/  redder/  and  reddest")/  while 
classifiers  cannot  ("horse/  horser/  and  horsest" 1 1 ?) . 

Immediately  following  the  NOUN  we  can  have  various 
qualifiers  (())/  whit!;  can  be  a  PREPG: 

(s92)  the  man  In  the  moon 
or  an  AOJG: 

(s93)  a  night  darker  than  doom 
or  a  CLAUSE  RSQ: 

(s94)  the  woman  who  conducts  the  orchestra 
We  have  already  discussed  the  many  types  of  RSQ  clauses,  in 
later  sections  we  will  discuss  the  PREPG  and  AOJG  types  which 
can  occur  as  qualifiers. 

Finally/  the  first  few  elements  In  the  NG  work  together  to 
give  Its  logical  description  --  whether  it  refers  to  a  single 
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object/  a  class  of  objects/  a  group  of  objects/  etc.  The 
determiner  (DET)  Is  the  normal  start  for  a  NG,  and  can  be  a  word 
such  as  "a"/  or  "that'/  or  a  possessive.  It  Is  followed  by  an 
"ordinal"  (ORD).  There  Is  an  Infinite  sequence  of  number 
ordinals  ("first/  second/  third,..")  and  a  few  others  such  as 
"last"  and  "next".  These  can  be  recognized  since  they  are  the 
only  words  that  can  appear  between  a  DET  like  "the"  and  a 
number/  as  In: 

(s95)  the  next  three  days 

Finally  there  Is  a  NUMber.  It  can  either  be  a  simple 

Integer  like  "one"/  "two"/  etc,  or  a  more  complex  construction 

such  as  "at  least  three"/  or  "more  than  a  thousand".  It  Is 

possible  for  a  NG  to  have  all  of  Its  slots  filled/  as  In: 

DET  ORD  NUM  ADJ  ADJ  CLASF  CLASF  NOUN 
the  first  three  old  red  city  fire  hydrants 

Q(PREPG)  Q(CLAUSE) 

without  covers  you  can  find 

It  Is  Is  also  possible  to  have  combinations  of  almost  any 
subset.  With  these  basic  components  In  mind/  let  us  look  at  the 
system  network  for  NG  In  Figure  29, 
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I  PRONG 


_ j  QUEST 


TPRONG 

PROPNG 


INDEX 


SUBJ 


|SUB> 


H0BJ2 

IQNTFR* 

OFOBJ 


DEM 

DEF -  POSES 


SUBJT  liNDEF 


COMP 


TIME 


PREPCBJ 


DEFPOSS 


HNS 
NPL 

* 

NFS 


HDEF 

... 


_ jNUM 

_ I  QUEST 


HNEG 


Figure  29  —NETWORK  3 


First  we  can  look  at  the  major  types  of  NG.  A  NG  made  up 
of  a  pronoun  Is  called  a  PRONG.  It  can  be  either  a  QUESTton, 
like  "who”  or  ’’what*'/  or  a  non-questf on  (the  unmarked  case)  like 
"I",  "them",  "It",  etc.  The  feature  TPRONG  marks  a  NG  whose 
head  Is  a  special  TPRON,  like  "something",  "everything", 
"anything",  etc.  These  enter  Into  a  peculiar  construction 


I  —  I  o 
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containing  only  the  head  and  qualifiers,  and  In  which  an 
adjective  can  follow  the  head,  as  In; 

(s95)  anything  green  which  Is  bigger  than  the  moon 

The  feature  PROPNG  marks  an  NG  made  up  of  proper  nouns, 
such  as  "John",  "Oklahoma",  or  "The  Union  Of  Soviet  Socialist 
Republ 1 cs . " 

These  three  special  classes  of  NG  do  not  have  the  structure 

described  above.  The  PRONG  Is  a  single  PRONoun,  the  PROPNG  Is  a 

string  of  PROPNs,  and  the  TPRONG  has  Its  own  special  syntax. 

The  rest  of  the  NGs  are  the  unmarked  (normal)  type.  They  could 

be  classified  according  to  exactly  which  constituents  are 

present,  but  In  doing  so  we  must  be  aware  of  our  basic  goals  In 

systemic  grammar.  We  could  note  whether  or  not  a  NG  contained  a 

CLASP  or  not,  but  this  would  be  of  minor  significance.  On  the 

other  hand,  we  do  note,  for  example,  whether  It  has  a  DET,  and 

what  type  of  DET  It  has,  since  this  Is  of  key  Importance  In  the 

meaning  of  the  NG  and  the  way  It  relates  to  other  units.  We 

distinguish  betwen  those  with  a  determlnar  (marked  DET)  and 

those  without  one  (NDET),  as  In; 

(s97)  Cats  adore  fish.  NDET 

(s98)  The  cat  adored  ^  fish.  DET 

The  DET  can  be  DEFInIte  (like  "the"  or  "that"),  INDEFInIte 

(like  "a"  or  "an"),  or  a  quantifier  (QNTFR)  (like  "some", 

"every",  or  "no").  The  DEFInIte  determiners  can  be  either 

DEMonst ra t I ve  ("this",  "that",  etc.)  or  the  word  "the"  (the 

unmarked  case),  or  a  POSSesslve  NG.  The  NG  "the  farmer's  son" 
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has  the  NG  "the  farmer"  as  Its  determiner,  and  has  the  feature 
POSES  to  Indicate  this. 

An  INDEF  NG  can  have  a  number  as  a  determiner,  such  as: 

(s99)  five  gold  rings 
(slOO)  ^  1  east  ^  dozen  eggs 

In  which  case  It  has  the  feature  NUMDET,  or  It  can  use  an  INDEF 
determiner,  such  as  "a".  In  either  case  It  has  the  choice  of 
being  a  QUESTIon.  The  cjuestlon  form  of  a  NUMDET  Is  "how  many", 
while  for  ether  cases  It  Is  "which"  or  "what". 

Finally,  an  NG  can  be  determined  by  a  quantifier  (QNTFR), 
Although  quantifiers  could  be  subclass  I  fled  along  various  lines, 
we  do  so  In  the  semantics  rather  than  the  syntax.  The  only 
classifications  used  syntactically  are  between  singular  and 
plural  (sae  below),  and  between  NEGatlve  and  non-negative. 

If  a  NG  Is  either  NUMO  or  QNTFR,  It  can  be  of  a  special 

type  marked  OF,  like: 

(slOl)  three  the  offices 
(sl02)  all  iif,  your  dreams 

An  OF  NG  has  a  DETermlner,  followed  by  "of",  followed  by  a 
OEFInIte  NG. 

A  determined  NG  can  also  choose  to  be  INCOMpIcte,  leaving 

out  the  NOUN,  as  an 

(sl03)  Give  me  three. 

{sl04)  I  want  none . 

Notice  that  there  Is  a  correspondence  between  the  cases  which 
can  take  the  feature  OF,  and  those  which  can  be  INCOM.  We 
cannot  say  either  "the  of  them"  or  "Give  me  the.". 


Possess  I ves 
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are  an  exception  (we  can  say  ''Give  me  Juan's."  but  not  "Juan's 
of  them"),  and  are  handled  separately  (see  below). 

The  middle  part  of  Network  3  describes  the  different 
possible  functions  a  NG  can  serve.  In  describing  the  CLAUSE,  we 
described  the  use  of  an  NG  as  a  SUBJ,  COMP,  and  OBJects  of 
various  types.  In  addition.  It  can  serve  as  the  object  of  a 
PREPG  (PREPOBJ),  In: 

(rl05)  the  race  of  the  lock 

If  It  Is  the  object  of  "of"  In  one  of  our  special  OF  NGs,  It  Is 

called  an  OFOBJ: 

(sl06)  none  of  vour  tricks 

A  NG  can  also  be  used  to  Indicate  TIME,  as  In: 

(sl07)  Yesterday  the  world  ended. 

(sl08)  The  day  she  left,  all  work  stopped. 

Finally,  a  NG  can  be  the  POSSesslve  determiner  for  another 
NG.  In: 

(sl09)  the  cook's  kettles 

the  NG  "the  cook"  has  the  feature  POSS,  Indicating  that  It  is 
the  determiner  for  the  NG  "the  cook's  kettle",  which  has  the 
feature  POSES. 

When  a  PRONG  Is  used  as  a  POSS,  It  must  use  a  special 

possessive  pronoun,  like  "my",  "your",  etc.  We  can  use  a  POSS 

In  an  Incomplete  NG,  like 

(sllO)  Show  me  vours . 

(sill)  John ' s  Is  covered  with  mud. 

There  Is  a  special  class  of  pronouns  used  In  these  NG's 
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(labelled  DEFPOSS)/  such  as  '‘ycL:rs''/  "trine”/  etc. 

Continuing  to  the  last  parf  of  Network  3/  we  see  fev:tures 
of  oerfon  and  number.  These  aro  useo  to  match  the  noun  to  the 
verb  (If  the  NG  Is  the  subject)  and  the  determiner/  to  avoid 
ungrammatical  combinations  like  "these  kanga-oo"  or  "the  wome-. 
wins".  In  the  case  of  a  PRONG/  there  are  special  pronouns  for 
first/  second/  and  third  person/  singular  and  plural.  The 
feature  NFS  occurs  only  with  the  flrst-percon  singular  p'*onouns 
("I"/  "me"/  "my",  "mine"),  and  no  distinction  Is  made  between 
other  persons,  since  they  have  no  effect  on  the  parsing.  All 
singular  pronouns  or  other  singula;*  NGs  are  marked  with  the 
feature  NS.  The  pronoun  "you"  Is  always  treated  as  If  It  were 
plural  and  no  distinction  Is  made  between  "we",  "you",  "they", 
or  any  plural  (NPL)  NG  as  far  as  the  grammar  Is  concerned.  Of 
course  there  Is  a  semantic  difference,  which  will  be  considered 
In  later  chapters. 


Section  2.3.6  -  Page  128 


2 . 3 . G  Preoos I t i on  Groups 

The  PREPG  is  a  comparatively  simple  structure  used  to 

express  a  rela  'onshlp.  It  consists  of  a  PREPosItlon  followed 

by  an  object  (PREPOBJ),  which  Is  either  a  NG  or  a  RSMG  CLAUSE. 

In  some  cases^  the  preposition  consists  of  a  two  or  three  word 

combination  Instead  of  a  single  word,  as  -n; 

(sll2)  next  to  the  table 
(sll3)  on  top  nt  the  house 

The  grammar  Includes  provision  for  this,  and  the  dictionary 
lists  the  possible  combinations  and  their  meanings.  The  words 
In  such  a  comblr'Jtlon  are  marked  as  PREP2.  The  network  for  the 
PREPG  .s  In  Figure  30. 

COMP 
LOBJ 
ADJUNCT 
AGENT 
aUEST 
RELPREPG 
UPREL 
UPQUEST 

I  Figure  30  —  NETWORK  4 

The  PREPG  can  serve  as  a  constituent  of  a  CLAUSE  In  several 
ways.  It  can  be  a  COMPlement: 

(sll4)  Is  It  In  the  kitchen? 
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a  Vocational  object  (LOBJ): 

(sllS)  Put  It  on  the  table. 
an  ADJUNCT; 

(sll6)  He  got  It  se  1  1  I  nt;  h  I  s  sou  1  . 

or  an  AGENT; 

(sll7)  It  was  bought  ^  the  dev  1 1 . 

If  the  PREPG  Is  a  constituent  of  a  QUESTION  CLAUSC/  It  can  be 

the  question  element  by  having  a  QUEST  NG  as  Its  object: 

(sll8)  In  what  cl  tv 
(sll9)  for  how  many  days 
( sl20)  by  whom 

In  which  case  the  PREPG  Is  also  marked  QUEST.  A  PREPPEL  CLAUSE 
contains  a  RELPREPG; 

(sl21)  the  place  In  wh I ch  she  works 

If  the  CLAUSE  Is  an  .PQUEST  or  an  UPREL,  the  PREPG  can  be 
the  constituent  which  Is  "missing"  the  piece  which  provides  the 
upward  refe.'ence.  In  this  case  It  Is  also  marked  UPREL: 

(sl22}  the  lady  I  saw  you  wl th 
or  UPQUEST; 

(sl23}  Who  did  you  knit  It  for? 

In  these  cases^  It  Is  also  marked  SHORT  to  Indicate  that  the 
object  Is  not  explicitly  In  the  PREPG.  It  can  r.lso  be  short  If 
It  Is  a  PREPG  In  a  DANGLING  PREPQ  or  PREPREL  CLaIjoE; 

(sl24)  what  do  you  keep  It  I n? 

Within  a  NG,  a  PREPG  serves  as  a  qualifier  (Q): 

(sl25>  the  man  Jn  the  I ron  mask 
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or  as  the  body  of  an  OF  M6: 

(sl26)  some  the  oeonl  e 


I 

I 
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Adjective  Groups 

The  ADJG  ts  a  specialized  unit  serving  as  a  COMPlement  of 
an  Intensive  clause/  as  a  Qualifier  to  an  NG,  or  as  a  CLAUSE 
ADJUNCT.  The  network  Is: 


Q - 

THAN 

ADJG - 

COMP— < 

AS 

ADJUNCT - 1 

COMPAR 

QUEST 

Figure  31  -- 

NETWORK 

5 

An  AOJG  which  ^jrves  as  an  ADJUNCT  contains  an  adverb/  like 
"fast"  In: 

(sl27)  He  could  run  faster  than  an  arrow. 

In  place  of  an  adjective.  (Clearly  our  terminology  could  do 
with  some  cleaning  up  at  places  like  this  In  doing  a  theoretical 
version  of  the  grammar.)  The  other  two  types  of  ADJG  use  an 
adjective/  as  In  a  Qualifier: 

(sl28}  a  hotel  bad  as  the  other  one 
or  a  COMPlement: 

(sl29}  They  were  bl I ssful . 

The  basic  forms  for  an  AOJG  Include  THAN: 

(sl30)  holler  than  thou 
AS: 
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(sl31)  quick  ^  a  flash 

COMPARat I ve: 

(sl32)  This  one  Is  b I gger . 
or  QUESTlon: 

(sl33)  How  we  1 1  can  he  take  dictation? 

The  network  Is  arranged  to  show  that  a  qualifier  ADJG  can 
be  only  of  the  first  two  forms  --  we  cannot  say  "a  man  bigger" 
without  using  "than"/  or  say  "a  man  big".  In  the  special  case 
of  a  TPRON  such  as  "anything"  as  In: 

(sl34)  anything  strange 

the  word  "strange"  Is  considered  an  ADJ  which  Is  a  direct 
constituent  of  the  NG/  rather  than  an  ADJG. 

The  grammar  does  not  yet  account  for  more  complex  uses  of 


the  word  "than" 
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2.3.8  Verb  Groups 

The  English  verb  group  Is  designed  to  convey  a  complex 
combination  of  tenses  so  that  an  event  can  relate  several  time 
references.  For  example,  vie  might  have: 

(sl35)  By  next  week  you  will  have  been  1  I v  I  ne 
here  for  a  month. 

This  Is  said  to  have  the  tense  "present  In  past  In  future".  Its 
basic  reference  Is  to  the  future  --  "next  week",  but  It  refers 
back  to  the  past  from  that  time,  and  also  Indicates  that  the 
event  Is  still  going  on.  This  type  of  recursive  tense  structure 
has  been  analyzed  by  Halllday  <Halliday  1966b>  and  our  grammar 
adopts  a  variant  of  his  scheme. 

Essentially  the  choice  Is  between  four  tenses,  PAST, 
PRESENT,  FUTURE,  and  MODAL.  Once  a  choice  between  these  has 
been  made,  a  second,  third,  fourth,  and  even  fifth  choice  can  ba 
made  recursively.  The  combination  of  tenses  is  realized  In  the 
syntax  by  a  sequence  of  the  auxllllary  verbs  "be",  "have",  and 
"going  to",  along  with  the  ING,  EN,  and  IMFInitIve  forms  of  the 
verbs.  The  restrictions  on  the  recursion  are: 

1.  PRESENT  can  occur  only  at  the  outer  ends  of  the  series 
(at  first  and/or  final  choice). 

2.  Except  In  the  final  two  positions,  the  same  tense 
cannot  be  selected  twice  consecutively. 

3.  Future  can  occur  only  once  other  than  In  last 
pos I t Ion. 

4.  Modal  can  be  only  In  final  position. 


frrr 


It  Is  Important  to  distinguish  between  the  position  of  a 
word  in  the  VG  and  the  position  of  Its  tesnse  In  the  recursive 
tense  feature  --  the  direction  Is  reversed.  In  sl35,  "will'*  Is 
the  first  word,  and  "living"  the  last,  while  the  tense  Is 
PRESENT  In  PAST  In  FUTURE.  Some  sample  verb  groups  and  their 
tenses  are:  (from  <HallIday  1966b>) 


ACTIVE 

took  -  past 
takes  -  present 
will  take  -  future 
can  take  -  modal 
has  taken  -  past  In  present 
v/as  taking  -  present  In  past 
was  going  to  have  taken  -  past  In  future  In  past 
was  going  to  have  been  taking  -  present  In  past  In  future  In  past 

PASSIVE 

Is  taken  -  present 

could  have  been  taken  -  past  In  modal 
has  been  going  to  have  been  taken  - 

past  In  future  In  past  In  present 

Figure  32  --  Verb  Group  Tenses 


The  structure  of  a  finite  V6  (one  taking  part  In  this  tense 
system  —  see  below  for  other  types)  Is  a  sequence  of  verbs  and 
auxllllaries  In  which  the  last  Is  the  "main  verb”  (marked  MVB 
and  remembered  by  the  parser),  and  the  first  Is  either  a  MODAL, 
the  word  "will",  or  a  "finite"  verb  (one  carrying  tense  and 
number  agreement  with  the  subject).  Interspersed  In  the 
sequence  there  may  be  adverbs,  or  the  word  "not"  (or  Its  reduced 
form  "n't").  The  best  way  to  describe  the  relationship  between 
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the  sequence  of  verbs  and  the  tense  Is  by  giving  a  flow  chart 
foi  parsing  a  VG.  This  Is  a  good  example  the  usefulness  uf 
representing  syntax  In  the  form  of  procedures,  as  It  describes  a 
relatively  complex  system  In  a  clear  and  succinct  way. 

In  the  flow  chart  (Figure  33)  the  variable  T  represents  the 
tense,  and  the  symbol  Indicates  the  addition  o*  a  member  to 

the  front  of  a  list.  The  Indicates  replacement  In  the 

FORTRAN  sense,  and  the  function  ’’REMOVE"  removes  words  from  the 
Input  string.  The  features  used  are  those  described  for  verbs 
In  section  2.3.9.  The  command  (FQ  PASV)  Indicates  that  the 
entire  VG  Is  to  be  marked  with  the  feature  PASV  (passive  voice). 
The  flow  chart  does  not  Indicate  the  ent 1  re  pars  I ng,  but  only 
that  part  relevant  to  determining  the  tense. 


1 


i 


1 
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EN'^ER 


y 


Figure  33  --  Syntax  of  VG  Tense  Structure 
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This  system  of  tenses  Is  operative  only  for  FINITE  verb 
groups.  The  network  for  the  VG  In  general  Is: 


FINITE 

C— 

IMPER 

EN  . 

ITOOEL 

TO . .  I 

ING 

VG - 

H 

BE 

jPASV 

Iactv 

1 

NEG 

Iri 

— 

Figure  34 

--NETWORK  6 ’ 

There  are  several  types  of  VG  which  do  not  enter  the  normal 

tense  system,  but  which  have  a  specialized  form.  The  IMPER  VG 

Is  used  In  Imperatives; 

(sl36)  FI  re  when  ready. 

(sl37}  Don ' t  drop  the  baby. 

It  consists  of  a  verb  In  the  INFlnltlve  form,  possibly  preceded 
by  the  auxi  Illary  '*do"  or  Its  negative  form  '•don't**.  The  EN  VG 
Is  used  In  EN  RSQ  CLAUSES,  like: 

(sl38)  a  man  forsaken  by  his  friends 
and  consists  of  a  past  partlcple  verb.  The  ING  VG  Is  made  up  of 
an  ING  verb  or  the  verb  "being"  followed  by  an  EN  verb.  It  Is 
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used  In  various  types  of  ING  clauses: 

(sl39)  Being  married  Is  great. 

(slUO)  the  girl  sitting  near  the  wall 

Similarly,  the  TO  VG  Is  used  In  TO  clauses.  In  the  case  of 
conjoined  structures,  the  "to"  may  be  omitted  from  the  second 
clause,  as  In: 

(sl41}  We  wanted  to  stop  the  war  and  end  repression. 

Such  a  VG  I s  marked  TODEL. 

We  separate  those  verb  groups  whose  main  verb  Is  "be"  from 
the  others,  as  they  do  not  undergo  the  further  choice  between 
PASV  and  ACTV.  These  correspond  to  the  same  features  for 
clauses,  and  are  seen  In  the  structure  by  the  fact  that  a  PASV 
VG  contains  a  form  of  the  auxllllary  "be"  followed  by  the  main 
verb  In  the  EN  form,  as  In: 

(sl42)  The  paper  was  finished  by  the  deadline. 

<sl43)  He  wanted  ^  kissed  by  the  bride. 

Finally,  any  VG  can  be  NEGatIve,  either  by  using  a  negative 
form  of  an  auxllllary  like  "don't",  "hasn't",  or  "won't",  or  by 
Including  the  word  "not". 
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■?->.?  1.9  Words 

Our  grammar  uses  a  number  of  separate  word  classes/  each  of 
which  can  be  divided  Into  subclasses  by  the  features  assigned  to 
individual  v;ords.  It  was  necessary  to  make  arbitrary  decisions 
as  to  whether  a  distinction  between  groups  of  words  should  be 
represented  by  different  classes  or  different  features  within 
the  same  class.  Actually  we  could  have  a  much  more  tree- like 
structure  of  word  classes/  In  which  the  Ideas  of  classes  and 
features  were  combined.  Since  this  has  not  yet  been  done/  we 
will  present  a  list  of  the  different  classes  In  alphabetical 
order/  and  for  each  of  them  give  descriptions  of  the  relevant 
features.  Many  words  can  be  used  In  more  than  one  clasS/  and 
some  classes  overlap  to  a  large  degree  (such  as  NOUN  and  CLASP). 
In  our  dictionary/  we  simply  list  all  of  the  syntactic  features 
the  word  has  for  all  of  the  classes  to  which  It  can  belong. 

When  the  parser  parses  a  word  as  a  member  of  a  certain  class/  It 
sorts  out  those  features  which  are  applicable.  Figure  35  Is  a 
list  of  the  word  classes  and  their  features. 

AOJ  --  Adjective  Is  one  of  the  constituents  of  a  NG  as  well  as 
being  the  main  part  of  an  ADJG.  This  class  Includes  words 
like  ''big"/  "ready"/  and  "strange".  The  only  features  are 
superlative  (as  In  "biggest")  and  COMPARative  (as  In 
"bigger"). 

ADV  —  We  use  the  name  "adverb"  to  refer  to  a  whole  group  of 

words  used  to  modify  other  words  or  clauses.  It  Is  sort  of  a 
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CLASS 

ADV 

BINDER 
CLASP 
r  ET 

AOJ 

NOUN 

NUM 

NUMD 

ORD 

PREP 

PREP2 

PRON 

PRONREL 

PROPN 

PRT 

QADJ 

TPRON 

VB 


FEATURES 

ADV  ADVADV  LOBJ  PLACE  PREPADV  TIMW  TIM2  VBAD 

BINDER 

CLASP 

DEF  DEM  DET  INCOM  INDEF  NEG  NONUM  NPL  NS  OFD  PART 
QDET  QNTFR 

ADJ  COMPAR  SUP 

MASS  NOUN  NPL  NS  POSS  TIME  TIMl 
NPL  NS  NUM 

NUMD  NUMDALONE  NUMDAN  NUMDAT 
ORD  TIMORD 
PLACE  PREP  NEED2 
PREP2 

DEFPOSS  NEG  NFS  NPL  NS  OBJ  POSS  PRON  REL  SUBJ 
NPL  NS  PRONREL 
NPL  NS  POSS  PROPN 
PRT 

PLACE  QADJ 
NEG  NPL  NS  TPRON 

AUX  BE  DO  EN  HAVE  IMPERF  INF  ING  INGOB  INGOB2  I  NT 
ITRNS  ITRNSL  MODAL  MVB  NEG  PAST  PRES  OUAX  REPOB  REPOB2 
SUBTOB  SUBTOB2  TOOB  TOOB2  T02  TRANS  TRANSL  TRANSL2  TRANS2 
VB  VFS  VPL  VPRT  V3PS  WILL 


Figure  35  --  VJord  Classes  and  Applicable  Features 
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"mixed  bag"  of  words  which  don't  really  fit  anywhere  else. 

The  basic  classification  depends  on  what  Is  being  modified# 

and  has  the  terms  (AOVADV  VBAO  PREPADV  CLAUSEADV).  An  ADVADV 

Is  a  word  like  "very"  v/hlch  modifies  other  adverbs  and 

adjectives.  A  VBAD  modifies  verbs#  and  Includes  the  class 

of  words  ending  In  "-1y"  like  "quickly"  and  "easily''.  A  I 

PREPADV  modifies  prepositions#  as  "directly"  In  "directly  I 

above  the  stove".  A  CLAUSEADV  Is  a  constituent  of  a  clause# 

and  can  be  either  TIMW  or  PLACE.  A  HMW  like  "usually"# 

"never"#  "then"#  or  "often"  appears  as  a  CLAUSE  constituent 
specifying  the  time.  The  PLACE  ADV  "there"  can  either  be  an 
adjunct,  as  In: 

(sl44)  There  I  sav/  a  miracle, 
or  an  LOBJ#  as  In: 

(sl4")  Put  It  there. 

BINDER  --  Binders  are  used  to  "bind"  a  secondary  clause  to  a 

major  clause#  as  In: 

(sl46)  Before  you  got  there#  we  left. 

{sl47)  I'll  go  JLf  you  do. 

We  do  not  assign  any  other  features  to  binders. 

CLASP  --  In  Section  2.3.5  we  discussed  the  use  of  CLASP  as  a 

constituent  of  a  NG.  The  CLASP  Is  often  another  NOUN#  but  It 
appears  In  a  position  like  an  adjective#  as  In  "bov  scout". 

DET  --  DETermlners  are  used  as  a  constituent  of  a  NG#  as 
described  In  2.3.5.  They  can  have  a  number  of  different 
features#  as  described  In  the  network: 
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1  MHC  C 

QDFT 

f  t  r 

IDEM 

r_ 

I- 

JOFO~SNCOM 

DET - 

QNTFR - 

1  — 

_ INEG 

1  — 

INONUM 

NPL 

i- 

■■■  ”1 

NS 

MASS 

Figure  36  —  NETWORK  7 

A  DET  can  be  INDEFinIte/  like  ''a’'  or  "an"  or  the 
question  determiners  (QDET)  "which"/  "what"/  and  "how  many". 
It  can  be  DEFInItO/  like  "the"  or  the  OEMonstrat I ve 
determiners  "this"/  "that"/  "those"/  and  "these".  Or  It  can 
be  a  quantifier  (QNTFR)  like  "any"/  "every"/  "some"/  etc. 
Quantifiers  can  have  the  feature  OFD/  Indicating  that  they 
can  be  used  In  an  OF  NG  like: 

(sl48)  soma  of  my  best  friends 
We  originally  had  a  separate  feature  named  INCOM  Indicating 
whether  they  could  be  used  In  an  Incomplete  NG  like: 

(sl49)  Buy  some. 

but  later  analysis  showed  these  features  were  the  same.  Not 
all  quantifiers  are  OFD  *-  we  cannot  say  "every  of  the  cats" 
or  "Buy  every."  Quantifiers  can  also  be  NEGatlvC/  like  "none" 
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or  "no"#  or  can  be  NONUM#  Indicating  that  they  cannot  be  used 
with  a  number#  such  as  "many"  or  "none"  (we  can  say  "any  three 
cats"  or  "no  three  cats"#  but  not  "none  three"  or  "many  three") . 
The  NG  program  takes  these  features  Into  account;  In  deciding 
what  NG  constituents  to  look  for.  It  also  has  to  find  agreement 
In  number  between  the  DET  and  the  NOUN.  A  DET  can  have  the 
features  "singular"  (NS)#  "plural"  (NPL)  or  MASS  (like  "some"  or 
"no"#  which  can  go  with  MASS  nouns  like  "water").  A  DET  can 
have  more  than  one  of  these  --  "the"  has  all  three#  while  "all" 
Is  MASS  and  NPL#  and  "a"  Is  just  NS.. 

NOUN  —  The  main  constituent  of  a  NG  Is  Its  NOUN.  I t  has  a 
feature  of  number#  Identical  to  that  of  the  DETermlners  It  must 
match.  The  word  "parsnip"  Is  NS#  "parsnips"  Is  NPL#  and 
"wheat"  Is  MASS.  Some  nouns  may  have  more  than  one  of  these# 
such  as  "fish"#  which  Is  all  three  since  It  can  be  used  In  "a. 
fish"#  "three  fish"#  or  "Fish  Is  my  favorite  food."  In 
addition#  a  NOUN  can  be  POSSessIve#  like  "parsnip's". 

In  order  to  tell  whether  a  NG  Is  functioning  as  a  time 
element  In  a  CLAUSE#  we  need  to  know  whether  Its  NOUN  can  refer 
to  time.  We  therefore  have  two  features  —  TIME  words  like 
"day"#  and  "month"#  as  In: 

(slSO)  The  next  dav  It  started  to  snow, 
and  TIMl  words  like  "yesterday"  and  "tomorrow".  This 
Illustrates  the  Interaction  between  syntax  and  semantics.  A 
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phrase  like  "the  next  visit”  can  be  used  to  Indicate  a  time, 
since  a  "visit”  Is  an  event.  The  actual  distinction  should 
be  the  semantic  difference  between  "event”  and  "non-event" 

The  grammar  could  be  easily  changed  to  look  at  the 
semantic  features  rather  than  syntactic  features  of  the  NOUN 
In  deciding  whether  It  could  be  the  head  of  a  TIME  NG. 

NUM  --  The  class  of  NUMbers  Is  large  (uncountably  Infinite)  but 
not  very  Interesting  syn tact  I  cal  1 y.  For  our  purposes  we  only 
note  the  features  NS  (for  "one")  and  NPL  (for  all  the  rest). 
In  fact/  our  system  does  not  accept  numbers  In  numeric  form, 
and  h..3  only  been  taught  to  count  to  ten, 

NUMD  --  In  complex  number  specifications,  like  "at  least  three” 
or  "more  than  a  million",  there  Is  a  NUMD.  The  features  they 
can  have  are  (NUMDAN  NUMDAS  NUMDAT  NUMOALONE).  NUMDAN  words 
such  as  "more"  and  "fewer"  are  used  with  "than”,  while  NUMDAS 
words  such  as  "few"  fit  Into  the  frame  "as... as",  and  NUMDAT s 
are  preceded  by  "at",  as  In  "at  least",  and  "at  most", 
NUMDALOME  Indicates  that  the  NUMD  can  stand  alone  with  the 
number,  and  Includes  "exactly"  and  "approximately". 

ORD  --  The  class  of  ORDlnals  Includes  the  ordinal  numbers 

"first",  "second",  etc.,  and  a  few  other  words  which  can  fit 
Into  the  position  between  a  determiner  and  a  number,  like 
"next",  "last",  and  "only".  Notice  that  Superlative 
ADJectlves  can  also  fill  this  slot  In  the  NG. 

PREP  --  Every  PREPG  begins  with  a  PREPosItlon,  either  alone,  or 
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as  part  of  a  combination  such  as  "on  top  of".  In  the 
combination  case#  the  words  following  the  Initial  PREP  liave 
the  feature  PREP2.  A  PREP  which  cannot  appear  without  a 
PREP2  (such  as  "next"  which  appears  In  "next  to")  are  marked 
NEE02. 

PRON  --  PRONouns  can  be  classified  along  a  number  of  dimensions/ 
and  we  can  think  of  a  large  mul tl-dImenslonaJ  table  with  most 
of  Its  positions  filled.  They  have  number  features  (MS  NPL 
NFS)  (note  that  Instead  of  the  more  usual  division  Into 
first/  second/  and  third  person,  singular  and  plural,  we  have 
used  a  reduced  one  In  which  classes  with  the  same  syntactic 
behavior  are  lumped  together).  They  can  be  POSSessIve,  such 
as  "your"  or  "my",  or  POSSOEF,  like  "yours"  or  "mine".  Some 
of  the  personal  pronouns  distinguish  between  a  SUBdect  form 
like  "I"  and  an  OBJect  form  like  "me",  there  are  also 
special  classes  like  DEMonstrat 1 ve  ("this"  and  "that")  and 
PRONREL  --  the  pronouns  used  In  relative  clauses,  such  as 
"who",  "which",  and  "that".  Those  which  can  be  used  as  a 
question  element,  such  as  "which"  and  "who"  are  marked  QUEST. 

PROPN  --  Proper  nouns  Include  single  words  like  "Carol",  or 
phrases  such  as  "The  American  Legion"  which  could  be  parsed, 
but  are  Interpreted  as  representing  a  particular  object 
(physical  or  abstract).  A  PROPN  can  be  NPL  or  NS,  and  Is 
assumed  to  be  NS  unless  defined  otherwise. 

PRT  --  In  Section  2.3.4,  we  discussed  clauses  which  use  a 


1 
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combination  of  a  "particle"  and  a  verb#  like  "pick  up"  or  j 

I 

"knock  out".  The  second  word  of  these  Is  a  PRT.  I 

QAOJ  --  One  class  of  QUESTION  CLAUSE  uses  a  QAOJ  such  as  | 

"where",  "when",  or  "how"  as  Its  question  element.  They  can 

I 

also  be  used  In  various  kinds  of  relative  clauses,  as  j 

explained  In  Section  2.3.3.  | 

i 

TPRON  --  There  Is  a  small  class  of  words  made  up  of  a  quantifier 
and  the  suffix  "-thing"  which  enter  Into  a  special  type  of  NG 
construction  like  "anything  green".  This  Is  not  an 

abbreviation  for  a  quantifier  followed  by  a  noun,  since  the  i 

1 

NG  "any  block  green"  would  have  the  same  structure  but  Is  not 
grammatical. 

VB  —  The  verb  has  tne  most  complex  network  of  features  of  any 
word  In  our  grammar.  They  describe  Its  tense,  transitivity, 
number,  and  use,  as  well  as  marking  special  verbs  like  "be". 

The  network  Is  In  Figure  37.  ' 

! 

Verbs  are  divided  Into  AUXIllIarles  and  others  i 

(unmarked).  AUXIllIarles  are  the  "helping  verbs"  which  j 

combine  with  others  In  complex  VG  structures.  They  can  have  i 

special  NEGatIve  forms,  like  "can't",  or  can  appear  standing 
alone  at  the  beginning  of  a  QUESTION,  In  which  case  they  have 
the  function  QAUX,  as  In:  j 

(sl51)  Will  I  ever  finish?  i 

The  auxllllarles  Include  "be",  "do",  "have",  "will",  and  the 
MODALs  like  "could"  "can",  and  "must".  Separate  features  are 


Figure  37 


NETWORK  8 
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used  for  these  as  they  are  critical  In  determining  the 
struc';ure  of  a  VG.  An  AUX  can  choose  from  the  system  of 
person  and  number,  distinguishing  "third-person  singular" 
(VSPS)  as  In  "Is",  "plural",  as  In  "have",  or  "first 
singular"  (VFS),  used  only  for  "am". 

Non-aux 1 1 1  I  ary  verbs  can  be  VPRT,  which  combine  with  a 
PRT,  and  they  have  a  whole  cluster  of  transitivity  features. 
In  Section  2.3.4  we  described  the  different  transitivity 
features  of  the  CLAUSE,  and  these  are  controlled  by  the  verb. 
We  therefore  have  the  features  (TRANS  ITRMS  TRANS2  TRANSl 
ITRNSL  INT)  Id  addition,  the  verb  can  control  what  types  of 
RSNG  CLAUSE  can  serve  as  Its  various  objects.  The  feature 
names  combine  the  type  of  CLAUSE  (ING  TO  REPORT  SU3T0  SUPING) 
with  either  -OB  or  -0B2,  to  get  a  product  set  of  features 
like  SU6T0B  and  ING0B2. 

For  example,  the  verb  "want"  has  the  features  TOOB  and 
SUBTOB,  but  not  INGOB,  REPOB,  etc.  since  "I  want  to  go."  and 
"I  want  you  to  go."  are  grammatical,  but  "I  want  going.",  "I 
want  that  you  go.",  etc.  are  not. 

Finally,  all  of  these  kinds  of  verbs  can  be  In  various 
forms  such  as  ING  ("breaking"),  EN  ("broken"),  INFInItIve 
("break),  PAST  ("broke"),  and  PRESent  ("breaks").  The 
network  does  not  Illustrate  all  of  the  relations,  as  some 
types  (like  MODAL)  do  not  make  all  of  these  choices. 
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2.3.10  Con ■ unct I  on 

One  of  the  most  complex  parts  of  English  Is  the  system  of 
conjunction.  This  section  presents  a  simplified  version  which 
has  been  Implemented  using  the  special  Interrupt  feature  of 
PROGRAMMAR  (see  Section  2.4.2  for  details).  This  makes  the 
parsing  particularly  simple. 

The  basic  concept  Is  that  anv  un I t  In  a  sentence  can  ce 
replaced  by  a  COMPOUND  unit  of  the  same  type.  In  the  sentence: 

(sl52)  I  baked  ^  chocol a te  cake .  three  pies,  and  seme 
hashish  brown  1 es . 

the  object  Is  a  COMPOUND  NG  with  three  components.  There  can  be 
a  compound  ADJ^  as  In: 

(sl53)  a  red  or  yellow  flag 
or  a  phrase  can  be  ambiguous/  as  In: 

(sl54)  black  cats  and  horses 

This  can  be  Interpreted  as  having  either  a  COMPOUND  NG/ 
composed  of  the  NGs  "black  cats"  and  "horses"/  or  a  single  MG 
with  a  COMPOUND  NOUN/  "cats  and  horses". 

The  features  of  a  COMPOUND  unit  are  determined  by  Its 
components  and  by  the  type  of  conjunction.  The  conjuncti  m 
features  are  from  the  following  network: 
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The  first  choice  Is  the  actual  conjunction  used.  The 
feature  BOTH  Indicates  a  word  at  the  beginning  of  a  COMPOUND 
structure/  as  In: 

(sl55)  both  you  and  your  family 
the  specific  word  depends  on  the  conjunction  —  "both"  with 
"and"/  "either"  with  "or/  and  "neither"  with  "nor".  The 
features  BOTH  and  NOR  combine  In: 

(sl56)  neither  you  nor  I 

A  conjoined  structure  can  be  made  up  pf  two  elements  with  a 
conjunction  (as  In  the  previous  three  examples)/  or  a  LIST 
connected  with  commas  and  a  conjunction  before  the  last  element 
(as  In  sl52)/  or  It  can  be  a  list  connected  with  conjunctions  (a 
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LISTA),  as  In: 

(sl57)  cabbages  and  kings  and  sealing  wax  and  things 

Every  constituent  but  the  first  Is  marked  with  the  feature 
COMPONENT.  The  COMPOUND  unit  also  takes  on  features  from  Its 
constituents.  It  may  have  features  such  as  number  and  tense/ 
relevant  to  Its  syntactic  function.  For  example/  a  COMPOUND  NG 
with  the  feature  AND  must  be  plural  (NPL)/  while  one  with  the 
feature  OR  will  have  the  number  feature  of  Its  Initial 
const  I tuent. 

Finally/  there  can  be  "forked"  structures  like: 

(sl58}  We  dropped  the  watermelon  and  ran. 

Notice  that  this  cannot  be  viewed  In  the  same  way/  since 
"dropped  the  watermelon"  Is  not  a  single  constituent  of  any 
unit.  This  sentence  Is  viewed  as  a  conjunction  of  two  clauses/ 
sharing  the  subject  "we".  The  second  clause  Is  marked  with  the 
feature  SU8JF0RK  to  Indicate  this.  Similarly/  the  subject  and 
verb  can  be  shared/  as  In: 

(sl59}  He  gave  Greg  the  apple  and  Gloria  the  pear. 

The  clause  "Gloria  the  pear"  Is  marked  as  VBFORK  and  SUBJFORK. 
The  only  constituents  relative  to  a  fork  are  those  necessary  for 
the  clause  structure.  In: 

(sl60)  They  easily  saw  the  trouble  and  cheerfully  fixed  It. 
the  parser  does  not  determine  whether  the  "easily"  applies  to 
both  constituents,  but  leaves  this  decision  to  semantics.  This 
Is  In  keeping  with  our  orientation  of  syntax  to  surface 
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structure  and  the  belief  thcit  much  of  "deep  structure"  Is  more 
the  province  of  the  semantic  analyzer. 
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2.3.11  Fol lowing  Uia  Parser  Jn  Operation 

Let  us  follow  the  parser  through  two  examples  to  see  how 
the  grammar  Is  used  In  practice.  We  will  not  actually  watch  all 
of  the  details/  or  deal  with  the  way  semantic  programs  are 
Intermixed  with  the  grammar.  Instead  we  will  follow  a  somewhat 
reduced  version,  to  get  a  feeling  for  the  way  the  grammar  works, 
and  the  way  It  Interacts  with  the  different  features  described 
above.  We  have  chosen  one  very  simple  sentence,  and  another 
which  Is  difficult  enough  to  exercise  some  of  the  more  complex 
features  of  the  grammar.  The  first  sentence  Is  the  first 
sentence  of  our  sample  dialog  (Section  1.3): 

Pick  up  a  big  red  block. 

The  system  begins  trying  to  parse  a  sentence,  which,  as 
explained  above,  means  looking  for  a  MAJOR  CLAUSE.  It  activates 
the  grammar  by  calling  (PARSE  CLAUSE  MAJOR).  Since  CLAUSE  Is 
one  of  our  units,  there  Is  a  program  defined  for  It.  The  CLAUSE 
program  Is  called  with  an  Initial  feature  list  of  (CLAUSE 
MAJOR). 

The  CLAUSE  program  looks  at  the  first  word.  In  order  to 
decide  what  unit  the  CLAUSE  begins  with.  If  It  sees  an  adverb. 
It  assumes  the  sentence  begins  with  a  single-word  modifier.  If 
It  sees  a  PREPosItlon,  It  looks  for  an  Initial  PREPG,  If  It 
sees  a  BINDER,  It  calls  the  CLAUSE  program  to  look  for  a  BOUND 
CLAUSE.  In  English  (and  possibly  all  languages)  the  first  word 
of  a  construction  often  gives  a  very  good  clue  as  to  what  that 
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construction  will  be.  We  have  "advance  notice"  of  what 
structures  to  look  for,  and  this  makes  parsing  much  easier.  Our 
grammar  tries  to  make  as  much  use  as  possible  of  these  natural 
"signals".  In  this  case/  the  Initial  word  Is  a  verb/  and  this 
Indicates  that  we  may  have  an  IMPERATIVE  CLAUSE.  The  program 
calls  (PARSE  VG  IMPER)  to  start  the  VG  program  with  the  Initial 
VG  feature  list  (VG  IMPER)/  looking  for  a  VG  of  the  right  type. 
The  VG  program  checks  this  Initial  feature  list/  and  sees  that 
It  Is  looking  for  an  IMPERatlve  VG.  This  must  either  begin  with 
some  form  of  the  verb  "do"/  or  with  the  main  verb  Itself.  Since 
the  next  word  Is  not  "do"/  It  calls  (PARSE  VB  INF  (MVB)).  This 
is  a  different  kind  of  call  to  PARSE/  since  VB  Is  not  a  unit  we 
have  defined.  It  Is  a  word  clasS/  and  the  call  says  to  check 
the  next  word  In  the  Input  (!n  this  case  still  the  first  word) 
to  see  whether  It  Is  Indeed  the  INFInltlve  form  of  a  VerB,  If 
SO/  It  Is  to  be  attached  to  the  parsing  tree/  and  given  the 
additional  feature  MVB  (main  verb).  The  current  structure  can 
be  d I sgrammcdt 

(CLAUSE  MAJOR)  (pick...) 

(VG  IMPER)*  (pick...) 

(VB  MVB  INF  TRANS  VPRT)  pick 

Figure  39  --  Syntactic  Structure  1 

We  use  several  conventions  for  diagramming  syntactic 
structures.  Rather  than  using  a  tree  format  (which  quickly 
grows  off  of  the  page)/  we  use  a  format  more  like  a  traditional 
outline/  with  the  sub-phrases  of  any  phrase  Indicated 
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Immediately  below  It  and  Indented.  We  use  the  symbol  to 

Indicate  the  program  which  Is  currently  active/  and  show  the 
actual  words  of  each  piece  of  the  sentence  to  the  right  of  the 
outline/  putting  units  larger  than  WORD  In  parentheses.  A 
series  of  dots  In  the  English  words  to  the  right  of  a  unit 
Indicates  that  the  program  for  that  unit  has  not  yet  finished. 

Figure  39  shows  that  we  have  a  CLAUSE/  with  a  constituent 
which  Is  a  V6/  and  that  the  VG  program  Is  active.  ‘  The  VG  so  far 
consists  of  only  a  VB.  Notice  that  some  new  properties  have 
appeared  on  the  list  for  VB.  We  have  not  mentioned  TRANS  or 
VPRT.  These  came  from  the  definition  of  the  word  "pick"  when  we 
called  the  function  PARSE  for  a  word  (see  section  2.4.4  for 
uetal 1 s) . 

Ordinarily  the  VG  program  checks  for  various  kinds  of  tense 
and  number/  but  In  the  special  case  of  an  IMPER  VB/  It  returns 
Immediately  after  finding  the  verb.  We  will  see  other  cases  In 
the  next  example. 

When  the  VG  program  succeeds/  CLAUSE  takes  over  again. 

Since  It  has  found  the  right  kind  of  VG  for  an  IMPERatlve 
CLAUSE/  It  puts  the  feature  IMPER  on  the  CLAUSE  feature  list. 

It  then  checks  to  see  whether  the  MVB  has  the  feature  VPRT/ 
Indicating  It  Is  a  special  kind  of  verb  which  takes  a  particle. 
It  discovers  that  "pick"  Is  such  a  verb/  and  next  checks  to  see 
If  the  next  word  Is  a  PRT,  which  It  Is.  It  then  checks  In  the 
dictionary  to  see  If  the  combination  "pick  up"  Is  defined/  and 
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when  It  discovers  this  Is  true/  It  calls  (PARSE  PRT)  to  add  "up" 
to  the  parsing  tree.  Notice  that  we  might  have  let  the  VG 
program  do  the  work  of  looking  for  a  PRT/  but  It  would  have  run 
Into  difficulties  with  sentences  like  "Pick  the  red  block  up." 

In  which  the  PRT  Is  displaced.  By  letting  the  CLAUSE  program  do 
the  looking/  the  problem  Is  simplified. 

As  soon  as  It  has  parsed  the  PRT/  the  CLAUSE  program  marks 
the  feature  PRT  on  Its  own  feature  list.  It  then  looks  at  the 
dictionary  entry  for  "pick  up"  to  see  what  transitivity  features 
are  there.  It  Is  TRANSltlve/  which  Indicates  that  we  should 
look  for  one  object  --  OBJl.  The  dictionary  entry  does  not 
Indicate  that  this  Is  a  verb  which  can  take  special  types  of 
RSNG  clauses  as  objects/  so  the  object  must  be  either  a  NG  or  a 
WHRS  clause  (which  can  appear  wherever  a  NG  can).  If  the  object 
were  a  WKRS  clause/  It  would  begin  with  a  relative  pronoun/  like 
"Pick  up  what  I  told  you  to."  Since  the  next  word  Is  "a"/  this 
Is  not  the  case/  so  the  CLAUSE  program  looks  for  an  object  by 
calling  (PARSE  NG  OBJ  OBJl)/  asking  the  NG  program  to  find  a  NG 
which  can  serve  as  an  OBJl.  The  structure  Is  new: 


(CLAUSE  MAJOR  IMPER  PRT)  (pick  up...) 

(VG  IMPER  )  (pick) 

(VB  MVB  INF  TRANS  PRT)  pick 

(PRT)  up 

(NG  OBJ  OBJl)*  (...) 


Figure  40  --  Syntactic  Structure  2 


The  NG  program  Is  started  and  notices  that  the  upcoming 
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word  Is  a  DET,  "a".  It  calls  (PARSE  DET)  to  add  It  to  the 
parsing  tree#  then  uses  the  function  TRMSF  to  transfer  relevant 
features  from  the  DET  to  the  entire  NG.  It  Is  Interested  In  the 
type  of  determination  (DEF  vs.  INDEF  vs.  QNTFR)/  and  the  number 
(NS  vs.  NPL) .  It  also  adds  the  feature  DET  to  the  MG  to 
indicate  that  It  has  a  determiner.  The  feature  list  for  the  NG 
Is  now; 

(NG  OBJ  OBJl  DET  INDEF  NS) 

since  "a"  Is  a  singular  Indefinite  determiner.  The  NG  program 
then  notices  the  feature  INDEF,  and  decides  not  to  look  for  a 
number  or  an  ordinal  (we  can't  say  "a  next  three  blocks"),  or 
for  the  OF  construction  ("a  of  them"  Is  Impossible).  It  goes  on 
Immediately  to  look  for  an  ADJectIve  by  calling  (PARSE  ADJ). 

When  this  succeeds  wit*'  the  next  word  "big",  a  simple  program 
loop  returns  to  the  (PARSE  ADJ)  statement,  which  succeeds  again 
with  "red",  on  the  next  trip  It  falls,  and  sends  the  program  on 
to  look  for  a  classifier,  since  "block"  Isn't  an  ADJ.  But 
"block"  Isn't  a  CLASF  either  In  our  dictionary,  so  the  NG 
program  goes  on  to  look  for  a  NOUN,  by  calling  (PARSE  NOUN). 

This  succeeds  with  the  NOUN  "block",  which  Is  singular,  and  the 
program  checks  to  see  If  It  agrees  with  the  number  features 
already  present  from  the  determiner  (to  eliminate  Illegal 
combinations  like  "these  boy").  In  this  case,  both  are  singular 
(NS),  so  the  program  Is  satisfied.  Ordinarily  It  would  go  on  to 
look  for  qualifiers,  but  In  this  case  there  Is  nothing  left  In 
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the  sentence.  Remember  that  we  have  an  especially  (tasy  way  of 
indicating  in  a  PROGRAMMAR  program  what  action  should  be  taken 
at  any  point  if  the  sentence  runs  out.  We  can  do  I t  by  simply 
putting  a  third  direction  In  any  branch  statement.  In  this 
case/  since  we  have  found  all  of  the  basic  constituents  we  need 
for  a  NG/  the  "third  branch"  tells  us  that  the  NG  program  should 
return  success.  If  we  had  run  out  after  the  determiner#  It 
would  have  sent  us  to  check  for  an  iNCOMplete  NG#  while  If  we 
had  run  out  after  an  ADJ  It  would  have  entered  a  backup  program 
which  would  check  to  see  whether  It  had  misinterpreted  a  NOUN  as 
an  ADJ. 

In  this  case#  the  NG  program  returns#  and  the  CLAUSE 
program  similarly  notices  that  the  sentence  has  ended.  Since  a 
TRANS  verb  needs  only  one  object#  and  that  object  has  been 
found#  the  CLAUSE  program  marks  the  feature  TRANS#  and  returns# 
ending  the  parsing.  In  actual  use#  a  semantic  program  would  be 
called  here  to  understand  and  execute  the  command  --  In  fact# 
semantic  programs  would  have  been  called  at  various  points 
throughout  the  process.  The  final  result  looks  likes 
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(CLAUSE  MAJOR  IMPER  PRT  TRANS)  (pick  up  a  big  red  block) 

(VG  IMPER)  (pick) 

(VB  MVB  INF  TRANS  VPRT)  pick 

(PRT)  up 

(NG  OBJ  O&Jl  OET  INDEF  NS)  (a  big  red  block) 

(DET  INDEF  NS)  a 

(ADJ)  big 

(ADJ)  red 

(NOUN  NS)  block. 

Figure  41  --  Syntactic  Structure  3 


I 
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How  let  us  take  a  more  complex  sentence/  like: 

How  many  blocks  are  supported  by  the  cube  n'hlch  I 
wanted  you  to  pick  up? 

We  will  not  go  Into  as  much  detail/  but  will  emphasize  the 
new  features  exhibited  by  this  example.  First/  the  parser 
recognizes  thi't  this  sentence  Is  a  question  by  Its  punctuation. 
It  ends  with  a  question  mark.  This  "cheating"  Is  not  really 
necessary,  and  In  the  future  the  grammar  will  be  revised  to  look 
for  the  other  signals  of  a  question  (for  example,  beginning  with 
a  determiner  like  "how  many"  or  "which"). 

In  any  event,  the  feature  QUESTION  Is  noted,  and  the 
program  must  decide  what  type  of  question  It  Is.  It  checks  to 
see  if  the  CLAUSE  begins  with  a  QADJ  like  "why",  "where",  etc. 
or  with  a  PREPosltlon  which  might  begin  a  PREPG  QUEST  (like  "In 
what  year..."). 

All  of  these  things  fall  In  our  example,  so  It  decides  the 
CLAUSE  must  have  a  NG  as  Its  question  element,  (called  NGQ), 
marks  this  feature,  and  calls  (PARSE  NG  QUEST).  The  NG  program 
starts  out  by  noticing  QUEST  on  Its  Initial  feature  list,  and 
looking  for  a  question  determiner  (DET  QDET) .  Since  there  are 
only  three  of  these  ("wnich",  "what",  and  "how  many"),  the 
program  checks  for  them  explicitly,  parsing  "how"  as  a  QDET,  and 
then  calling  (PARSE  NIL  MANY),  to  add  the  word  "many"  to  the 
parsing  tree,  without  worrying  about  Its  features.  (The  call 
(PARSE  NIL  X)  checks  to  see  If  the  next  word  Is  actually  the 
word  "x") ) . 
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Since  a  determiner  has  been  founds  Its  properties  are  added 
to  the  NG  feature  list/  (In  this  case/  (NUMDET  INREF  NPL))/  and 
the  NG  program  goes  on  with  Its  normal  business/  looking  for 
adjectives/  classifiers/  and  a  noun.  It  finds  only  the  NQUM 
“blocks"  with  the  features  (NOUN  NPL) .  The  word  “block"  appears 
In  the  dictionary  with  the  feature  NS/  but  the  Input  program 
which  recognized  the  plural  ending  changed  NS  to  NPL  for  the 
form  “blocks".  Agreement  Is  checked  between  the  NOUN  and  the 
rest  of  the  NG/  and  since  “how  many"  added  the  feature  NPL/  all 
Is  well.  This  time/  there  Is  more  of  the  sentence  left/  so  the 
NG  program  continues/  looking  for  a  qualifier.  It  checks  to 
see  If  the  next  word  Is  a  PREPosltlon  (as  In  "blocks  aq  the 
table)/  a  relative  word  (“blocks  which.. .).  a  past  participle 
(“blocks  suppoorted  by...)/  an  INP  verb  (“blocks  sitting  on...) 
a  compara.’’lve  adjective  (“blocks  bl gger  than...)  or  the  word 
“as"  (“blocks  aS,  big  as...).  If  any  of  these  are  true/  It  tries 
to  parse  the  appropriate  qualifying  phrase.  If  not/  It  tries  to 
find  an  RSQ  CLAUSE  (“blocks  the  block  supports),  in  this  case/ 
all  of  these  fall  since  the  next  word  Is  “are"/  so  the  NG 
program  decides  It  will  find  no  qualifiers/  and  returns  what  It 
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Next  the  CLAUSE  program  wants  a  VG,  so  It  calls  (PARSE  VG 
NAUX).  The  feature  NAUX  Indicates  that  we  want  a  VG  which  does 
not  consist  of  only  an  AUXllllary  verb#  like  "be"  or  "have".  If 
we  saw  such  a  VG,  It  would  Indicate  a  structure  like  "How  many 
blocks  are  the  boxes  supporting?".  In  which  the  question  MG  Is 
the  object  of  the  CLAUSE.  We  are  Interested  In  first  checking 
for  the  case  where  the  question  MG  Is  the  subject  of  the  CLAUSE. 

The  VG  program  Is  designed  to  deal  with  combinations  of 
auxllllary  verbs  like  "had  been  going  to  be..."  and  notes  that 
the  first  verb  Is  a  form  of  "be".  It  calls  (PARSE  VB  AUX  BE), 
assuming  that  "are"  Is  an  auxllllary  rather  than  the  main  verb 
of  the  sentence  (If  this  turns  out  wrong,  there  Is  backup).  It 
transfers  the  Initial  tense  and  person  features  from  this  verb 
to  the  entire  VG  (The  English  VG  always  uses  the  leading  verb 
for  these  features,  as  In  "He  has  been...",  where  It  Is  "has" 
which  agrees  with  "he")  in  this  cssa  "are"  Is  plural  (VPL)  and 
present  tense  (PRES). 

When  "be"  Is  used  as  an  auxllllary.  It  Is  followed  by  a 
verb  In  either  the  ING  or  the  EN  form.  Since  "supported"  Is  an 
EN  form  (and  was  marked  that  way  by  the  Input  program).  The  VG 
program  calls  (PARSE  VB  EN  (MVB)),  marking  "supported"  as  the 
main  verb  of  the  clause.  The  use  of  a  "be"  followed  by  an  EN 
form  Indicates  a  PASV  VG,  so  the  feature  PASV  Is  marked,  and  the 
VG  program  Is  ready  to  check  agreement.  Notice  that  so  far  we 
haven't  found  a  SUBJect  for  this  clause,  since  the  QUESTIon  NG 
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might  have  been  an  object#  as  In  ”Hqw  many  blocks  does  the  box 
support?”  Hctwever  the  VG  program  Is  aware  of  this#  and  realize.# 
that  Instead  of  checking  agreement  with  the  constituent  marked 
SUnj#  It  must  use  the  one  marked  QUEST.  It  uses  PROGRAMMAR’ s 
pointer-moving  functions  to  find  this  constituent#  and  notes 
that  It  Is  NPL#  which  agrees  with  VPL.  VG  therefore  Is  happy 
and  returns  Its  value.  We  now  have: 


(CLAUSE  MAJOR  QUESTION  NGQ)*  (how  many  blocks  are  supported...) 


(NG  QUEST  DET  NUMDET  NPL  INDEF) 
(DET  QOET  NPL  INDEF) 

() 

(NOUN  NPL) 


(how  many  blocks) 
how 
many 
blocks 


(VG  NAUX  VPL  PASV  (PRES)) 
(VB  AUX  BE  PRES  VPL) 
(VB  MVB  EN  TRANS) 


(are  supported) 
are 

supported 


Figure  43  —  Syntactic  Structure  5 


The  CLAUSE  program  resumes#  and  marks  the  feature  SUBJQ# 
since  It  found  the  right  kind  of  VG  to  Indicate  that  the  NG  "how 
many  blocks”  Is  Indeed  the  subject.  It  next  checks  to  see  If  we 
have  a  PRT  situation  as  we  did  In  our  first  example.  We  don't# 
so  It  next  checks  to  see  If  the  VG  Is  PASV#  and  marks  the  clause 
with  the  feature  PASV.  This  Indicates  that  there  wl’il  be  no 
objects#  but  there  might  be  an  AGENT  phrase.  It  checks  that  the 
next  word  Is  "by”#  and  calls  (PARSE  PREPG  AGENT). 

The  PREPG  program  Is  fairly  simple  —  It  first  calls  (PARSE 
PREP)#  vhen  'PARSE  NG  OBJ  PREPOBJ).  The  word  "by”  Is  a  PREP#  so 
the  first  call  succeeds  and  NG  Is  called  and  operates  as 
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described  before#  finding  the  DET  "the"  and  the  NOUN  "cube"#  and 
checking  the  appropriate  number  features.  In  this  case#  "the" 

Is  both  NPL  and  NS#  while  "cube"  Is  only  NS#  so  after  checking 
the  NG  has  only  the  feature  NS. 

The  NG  program  next  looks  for  qualifiers#  as  described 
above#  and  this  time  It  succeeds.  The  word  "which"  signals  the 
presence  of  a  RSQ  WHRS  CLAUSE  modifying  "cube".  The  NG  program 
therefore  calls  (PARSE  CLAUSE  RSQ  WHRS).  The  parsing  tree  now 


looks  like: 


(CLAUSE  MAJOR  QUESTION  NGQ  SUBJQ  PASV) 

(how  many  blocks  are 

supported  by  the  cube...) 

(NG  QUEST  DET  NUMOET  NPL  INDEF) 

(how  many  blocks) 

(DET  QDET  NPL  INDEF) 

how 

() 

many 

(NOUN  NPL) 

blocks 

(VG  NAUr,  VPL  PASV  (PRES)) 

(are  supported) 

(VB  AUX  BE  PRES  VPL) 

are 

(V3  MVB  EN  TRANS) 

supported 

(PREPG  AGENT) 

(by  the  cube. . . ) 

(PREP) 

by 

(NG  OBJ  PREPOBd  DET  DEF  NS) 

( the  cube. . . ) 

(DET  DEF  NPL  NS) 

the 

(NOUN  NS) 

cube 

(CLAUSE  RSQ  WHRS)* 

(...) 

Figure  44  --  Syntactic 

Structure  6 

The  CLAUSE  program  Is  Immediately  dispatched  by  the  feature 
WHRS  to  look" for  a  RELWD.  It  finds  "which"#  and  marks  Itself  as 
NGREL.  It  then  goes  on  to  look  for  a  (VG  NAUX)  just  as  our 
QUESTION  NGQ  clause  did  above.  Remember  that  WH-  questions  and 
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WHRS  clauses  share  a  great  deal  of  the  network^  and  they  share 
much  of  the  program  as  well.  This  time  the  VG  program  falls/ 
since  the  next  word  Is  ”1'*/  so  the  CLAUSE  program  decides  that 
the  clause  "which  I..."  Is  not  a  SUBJREL.  It  adds  the  temporary 
feature  NSUBREL,  Indicating  this  negative  knowledge#  but  not 
deciding  yet  just  what  we  do  have.  It  then  goes  to  the  point  In 
the  normal  clause  program  which  starts  looking  for  the  major 
constituents  of  the  clause  --  subject#  verb#  etc.  We  call 
(PARSE  NG  SUBJ)  and  succeed  with  the  PRONG  "I".  We  then  look 
for  a  VG#  and  fine’  "wanted".  In  this  case#  since  the  verb  Is 
PAST  tense#  It  doesn't  need  to  agree  with  the  subject  (only  the 
tenses  beginning  with  PRES  show  agreement).  The  feature  NAGR 
marks  the  non-appl I cabi 1 1 ty  of  agreement.  The  parsing  tree  from 
the  WHRS  node  on  down  Is  now: 


|(CLAUSE  RSQ  WHRS  NGREL  NSUBRED*  (which 

1 

1  wanted...) 

(RELWD) 

wh  I  ch 

(NG  SUBJ  PRONG  NFS) 

( 1 ) 

(PRON  NFS) 

1 

(VG  NAGR  (PAST)) 

(wanted) 

(VB  MVB  PAST  TRANS 

TOOBJ  SUBTOBJ) 

wanted 

Figure  45  — 

Syntactic  Structure  7 

The  CLAUSE  program  notes  that  the  MVB  Is  TRANS  and  begins 
to  look  for  an  OBJl.  This  time  It  also  notes  that  the  verb  Is  a 
TOOBJ  and  a  SUBTOBJ  (It  can  take  a  TO  clause  as  an  object#  as  In 
"I  wanted  ifi  go.",  or  a  SUBTO#  as  In  "I  wanted  you  to  go."  Since 
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the  next  word  Isn't  "to"#  It  decides  to  look  for  a  SUBTO  clause# 
calling  (PARSE  CLAUSE  RSNG  OBu  OBJl  SUBTO).  In  fact#  this 
checking  for  different  kinds  of  RSNG  clauses  Is  done  by  a  small 
function  ..amed  PARSEREL#  which  looks  at  the  features  of  tha  MVB# 
and  calls  the  appropriate  clauses.  PARSEREL  Is  used  at  several 
points  In  the  grammar#  and  one  of  main  advantages  of  writing 
grammars  as  programs  Is  that  we  can  write  such  auxllllary 
programs  (whether  In  PROGRAMMAR  or  LISP)  to  make  full  use  of 
regularities  In  the  syntax. 

The  CLAUSE  program  Is  called  recursively  to  look  for  the 
SUBTO  clause  "you  to  pick  up".  ;f  finds  the  subject  "you"#  and 
calls  (PARSE  VG  TO)  since  It  needs  a  verb  group  of  the  "to" 
type.  The  VG  program  notices  this  feature  and  finds  the 
appropriate  VG  (which  Is  again  NAGR).  The  PRT  mechanism 
operates  as  described  In  the  first  example#  and  the  bottom  of 
our  structure  now  looks  like: 
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(CLAUSE  RSQ  WHRS  NGREL  NSUBREL)  (which 

1  wanted  you 

to  pick  up) 

(RELWD) 

wh  I  ch 

(NG  SUBd  PRONG  NFS) 

(1) 

(PRON  NFS) 

1 

(VG  NAGR  (PAST)) 

(wanted) 

(VB  MVB  PAST  TRANS  TOOBJ  SUBTOBJ)  wanted 

(CLAUSE  RSNG  SUBTO  OBJ  OBJl  PRT)* 

(you  to  pick 

up) 

(NG  SUBJ  PRONG  NPL) 

(you) 

(PRON  NPL) 

you 

(VG  TO  NAGR) 

(to  pick) 

() 

to 

(VB  MVB  INF  TRAMS  VPRT) 

pick 

(PRT) 

up 

Figure  46  —  Syntactic  Structure  8 

Notice  that  we  have  a  transitive  verb-particle  combination, 
"pick  up",  with  no  object,  and  no  words  left  In  the  sentence. 
Ordinarily  this  would  cause  the  program  to  start  backtracking  -- 
checking  to  see  If  the  MVB  Is  also  Intransitive,  or  If  there  Is 
some  way  to  reparse  the  clause.  However  we  are  In  the  special 
circumstance  of  an  embedded  clause  which  Is  somewhere  on  the 
parsing  tree  belov;  a  relative  clause  with  an  "unattached" 
relative.  In  the  clause  "which  I  told  you  to  pick  up",*!" Is 
the  subject,  and  the  CLAUSE  "you  to  pick  up"  Is  the  object.  The 
"which"  has  not  been  related  to  anything.  There  Is  a  small 
program  named  UPCHECK  which  uses  PROGRAMMAR' s  ability  to  look 
around  on  the  parsing  tree.  It  looks  for  this  special 
situation,  and  when  It  finds  It  does  three  things;  1)  Mark  the 
current  clause  as  UPREL,  and  the  appropriate  type  of  UPREL  for 
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the  thing  It  Is  missing  (In  this  case  OBJIUPREL) .  2)  Remove 

the  feature  NSUBREL  from  the  clause  with  the  unattached  relative 
3)  Replace  It  v^lth  DOVJNREL  to  indicate  that  the  relative  has 
been  found  below.  This  can  all  be  done  with  simple  programs 
using  the  basic  PROGRAMMAR  primitives  for  moving  around  the 
tree  (see  section  2.4.10}  and  manipulating  features  at  nodes 
(see  2.4.11).  The  Information  which  Is  left  In  the  parsing  tree 
Is  sufficient  for  the  semantic  routines  to  figure  out  the  exact 
relationships  between  the  various  pieces  Involved. 

In  this  example/  once  the  CLAUSE  "to  pick  up"  has  been 
marked  as  OBJIUPREL,  It  has  enough  objects,  and  can  return 
success  since  the  end  of  the  sentence  has  arrived.  The  CLAUSE 
"which  I  want  you  to  pick  up"  has  an  object,  and  has  Its 
relative  pronoun  matched  to  something,  so  It  also  succeeds,  as 
does  the  NG  "the  cube...",  the  PREPG  "by  the  cube..",  and  the 
MAJOR  CLAUSE.  The  final  result  Is  shown  In  Figure  47. 

Even  In  this  fairly  lengthy  description,  we  have  left  out 
much  of  what  was  going  on.  For  example  we  have  not  mentioned 
all  of  the  places  where  the  CLAUSE  program  checked  for  adverbs 
(like  "usually"  or  "quickly"),  or  the  VG  program  looked  for 
"not",  etc.  These  are  all  "quick"  checks,  since  there  Is  a 
PROGRAMMAR  command  which  checks  the  features  of  the  next  word. 

In  following  the  actual  programs,  the  course  of  the  process 
would  be  exactly  as  described,  without  backups  or  other  attempts 
to  parse  major  structures. 
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(CLAUSE  MAJOR  QUESTION  NGQ  SUBJQ  PASV  AGENT) 


(NG  QUEST  DET  NUMDET  NPL  INOEF)  (how  many  blocks) 

(DET  QDET  NPL  INOEF)  how 

( )  many 

(NOUN  NPL)  blocks 

(VG  NAUX  VPL  PASV  (PRES))  (are  supported) 

(VB  AUX  BE  PRES  VPL)  are 

(VB  MVB  EN  TRANS)  supported 

(PREPG  AGENT)  (by  the  cube  which  I  wanted  you  to  pick  up) 

(PREP)  by 

(NG  OBJ  PREPOBJ  DET  OEF  NS) 

(the  cube  which  I  wanted  you  to  pick  up) 

(DET  DEF  NPL  NS)  the 

(NOUN  NS)  cube 

(CLAUSE  RSQ  WHRS  NGREL  DOWNREL  TRANS) 

(which  1  wanted  you  to  pick  up) 

(RELWD)  which 

(NG  SUBJ  PRONG  NFS)  (I) 

(PRON  NFS)  1 

(VG  NAGR  (PAST))  (wanted) 

(VB  MVB  PAST  TRANS  TOOBJ  SUBTOBJ)  wanted 

(CLAUSE  RSNG  SUBTO  OBJ  OBJl  PRT 

TRANS  UPREL  OBJIUPREL)  (you  to  pick  up) 

(NG  SUBJ  PRONG  NPL)  (you) 

(PRON  NPL)  you 

(VG  TO  NAGR)  (to  pick) 

()  to 

(VB  MVB  INF  TRANS  VPRT)  pick 

(PRT)  up 


Figure  47  -  Syntactic  Structure  9 
"How  many  blocks  are  supported  by  the  cube 
which  I  wanted  you  to  pick  up?" 
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This  may  seem  like  a  quite  complex  process  and  complex 
grammar,  compared  to  other  systems,  or  even  our  own  examples  In 
Section  2.2.  This  Is  because  language  Is  Indeed  a  highly 
complex  phenomenon.  We  have  tried  to  handle  a  great  deal  more 
of  the  complexity  of  English  than  any  of  the  previous  language¬ 
understanding  systems.  It  Is  only  due  to  the  fact  that 
PROGRAMMAR  gives  us  an  easy  framework  In  which  to  Include 
complexity  that  It  was  at  all  possible  to  Include  such  a 
detailed  grammar  as  only  one  part  of  a  project  carried  out  by  a 
single  person  In  less  than  two  years. 
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2-.?>l2  Anal  Y.S  U  SiL  Word  Endings 

This  section  describes  the  "spelling  rules’'  used  by  the 
program  In  recognizing  Inflectional  endings  of  words.  For 
spoken  language^  these  would  be  called  the  "morphophonemi c" 
rules#  but  since  we  deal  with  written  language#  they  are 
"morpho-graphemic." 

These  rules  enable  a  reader  to  recognize  that#  for  example# 
"pleasing"  Is  a  form  of  "please"#  while  "beating"  Is  a  form  of 
"beat".  There  Is  a  structure  of  conventions  for  doubling 
consonants#  dropping  "e"#  changing  "I"  to  "y"#  etc.  when  adding 
endings#  and  a  corresponding  set  for  removing  them. 

A  word  like  "running"  need  not  have  separate  entry  In  the 
dictionary#  since  It  Is  a  regular  Inflected  form  of  "run".  The 
program  can  use  an  Interpretive  procedure  to  discover  the 
underlying  form  and  attach  the  appropriate  syntactic  features 
for  the  inflection. 

In  des’gning  a  formalism  for  these  rules#  It  seems  most 
natural  to  express  them  as  a  program  for  Interpretation.  The 
flow  chart  In  Figure  48  Is  designed  to  handle  a  number  of 
Inflectional  endings  --  "-n't"  for  negative#  "-’s"  and  for 

possessive#  "-s"  and  Its  various  forms  for  plural  nouns  and 
singular  third-person  verbs#  "-Ing"#  "-ed"#  and  "-en"  verb 
forms#  the  superlative  "-est"  and  comparative  "-er"#  and  the 
adverbial  "-ly". 

As  the  flowchart  shows#  these  endings  share  many  aspects  of 
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morphographemi c  structure/  and  the  program  representation  Is 
able  to  capture  these  generalities  as  well  as  det:z!  1 1ng  those 
aspects  peculiar  to  each  ending.  It  Is  not  a  complete 
description,  but  covers  a  wide  variety  of  words,  and  could 
easily  be  expanded  to  treat  more  special  cases. 

The  description  uses  a  few  simple  notations  In  addition  to 
normal  flowchart  conventions.  "No"  and  "yes"  answers  to  branch 
conditions  are  represented  by  single  and  double  arrows 
respectively.  The  function  "cutoff"  Indicates  what  Is  to  be 
cut  off  of  the  end  of  the  word.  The  ordinals  "1st",  "2nd",  etc. 
count  letters  from  the  end  of  the  word  backwards.  Ignoring  those 
which  have  been  cut  off.  Several  classes  of  letters  are 
relevant  to  endings  —  VOWEL  Includes  (A  E  1  0  U  Y),  LIQUID  Is 
(L  R  S  V  Z),  and  NOEND  Is  <C  G  S  V  Z).  The  label  "TRY" 
represents  the  part  of  the  program  which  tries  looking  up  the 
supposed  root  In  the  dictionary.  It  uses  whatever  letters  have 
not  been  "cut  off".  At  some  places,  the  program  tries  an 
Interpretation,  then  If  that  falls,  carries  out  a  different 
analysis  and  tries  again. 

The  program  can  correctly  analyze  such  roots  as:  bashes  - 
bash,  bathes  -  bathe,  leaning  -  lean,  leaving  -  leave,  dented  - 
dent,  danced  -  dance,  dogs  -  dog,  kisses  -  kiss,  curved  -  curve, 
curled  -  curl,  rotting  -  rot,  rolling  -  roll,  played  -  play, 
plied  -  ply,  realest  *  real,  palest  -  pale,  knives  -  knife, 
prettily  ~  pretty,  nobly  -  noble,  etc. 
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It  is  Important  to  note  that  exceptions  do  not  need  to  be 
explicitly  Included  In  the  analysis  program.  The  word  "was"  Is 
directly  In  the  dictionary,  and  the  rules  will  tnerefore  never 
be  applied  In  trying  to  analyze  It.  The  distinction  between 
lexical  I d losyncraci es  and  generalities  Is  empirical.  A 
productive  rule  like  the  one  relating  "sang"  and  "sing",  or 
"rang"  and  "ring"  might  well  be  Included  In  a  more  complete 
program,  while  less  productive  ones  (such  as  relating  "will"  to 
"won't")  will  be  left  as  separate  dictionary  entries. 

It  Is  tempting  to  see  this  program  as  a  simple  finite  state 
machine,  but  this  Is  not  the  best  formalism  for  several  reasons. 
First,  the  tests  which  can  be  done  to  a  word  In  deciding  on  a 
transition  are  not.  In  general,  simple  checks  of  the  next  Input 
letter.  Whether  a  certain  analysis  Is  possible  may  depend,  for 
example,  on  how  many  syllables  there  are  In  the  word,  or  on  some 
complex  phonological  calculation  Involving  vowel  shifts. 

Semantic  and  syntactic  Information  should  be  applied  as  well  In 
many  cases. 

The  morphographemi c  analysis  alone  Is  sufficient  to  do  a 
great  deal  of  the  Interpretation  of  English  endings.  In  fact, 
some  systems  (<Thorne  1969>)  use  It  to  avoid  having  a  dictionary 
of  open  class  words.  The  Inflection  of  the  words  In  the  Input 
determines  their  syntactic  class  possibilities.  Of  course, 
without  a  dictionary,  they  cannot  attempt  to  deal  with  the 
meaning  of  words,  beyond  trying  to  match  them  with  other 
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occurences  of  the  same  root. 

People  rarely  operate  at  this  level  of  Ignorance.  They  use 
t:hetr  lexical  knowledge  to  realize  that  "under"  Is  not  a 
comparative  form  of  some  adjective  "und",  and  that  "bely"  Is  not 
an  adverbial  form  of  "be".  This  knowledge  can  I'eadlly  be 
Integrated  Into  the  Interpretive  program.  Once  a  possible 
morpiiographemlc  analysis  Is  found/  the  hypothetical  root  can  be 
cnecked  In  the  dictionary.  If  It  Is  not  there/  the  program  can 
try  another  analysis.  Notice  that  In  the  flow  chart/  words 
ending  In  a  double  LIQUID/  followed  by  an  Inflection/  are  first 
tried  In  the  form  with  the  doubled  consonant  (as  In  "rolling"  - 
"roll")/  then  If  that  fallS/  another  try  Is  made  with  the  single 
consonant  ("patrolled"  -  "patrol"). 

if  the  root  Is  found  In  the  dictloary/  Its  syntactic  class 
Is  checked  to  see  If  the  ending  Is  appropriate  (e.g.  a  "-est" 
ending  can  apply  only  to  an  ADJ).  The  list  of  syntactic 
features  Is  changed  to  take  the  ending  Into  account.  For 
example/  the  ending  "-s"  changes  the  feature  NS  (singular  noun) 
to  NPL  (plural  noun)  and  the  feature  INF  (Infinitive  verb)  to 
V3PS  (third-person  singular  verb).  A  word  which  fits  Into  more 
than  one  category  may  have  more  than  one  set  of  features 
changed/  In  determining  all  of  the  possible  Interpretations. 

The  problems  Involved  In  analyzing  more  complicated 
endIngS/  such  as  the  "-tion"/  "-ment"/  and  "-er"  endings  of 
nominal Izatlons  wl  1 1  be  discussed  In  a  later  paper. 
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2.4  Programming  Details 
2.4.1  Operation  of  the  System 

Since  the  grammar  Is  Itself  a  program,  there  Is  not  much 
overhead  mechanism  needed  for  the  basic  operation  of  the  parser. 
Instead,  the  system  consists  mostly  of  special  functions  to  be 
used  by  the  grammar.  The  system  maintains  a  number  of  global 
variables,  and  keeps  track  of  the  parsing  tree  as  It  Is  built  by 
the  main  function,  PARSE.  When  the  function  PARSE  Is  called  for 
a  UNIT  which  has  been  defined  as  a  PROGRAMMAR  program,  the 
system  collects  Information  about  the  currently  active  node,  and 
saves  I t  on  a  pushdown  list.  It  then  sets  up  the  necessary 
variables  to  establish  a  new  active  node,  and  passes  control  to 
the  PROGRAMMAR  program  for  the  appropriate  unit.  If  this 
program  succeeds,  the  system  attaches  the  new  node  to  the  tree, 
and  returns  control  to  the  node  on  the  top  of  the  POL.  If  It 
falls.  It  restores  the  tree  to  Its  state  before  the  program  was 
called,  then  returns  control.  A  PROGRAMMAR  program  Is  actually 
converted  by  a  simple  compiler  to  a  LISP  program  and  run  In  that 
form.  The  variables  and  functions  available  for  writing 
PROGRAMMAR  programs  are  described  In  the  rest  of  part  2.4.  In 
order  to  make  these  details  more  Independent  of  our  detailed 
grammar  of  English,  we  will  continue  to  use  a  simplified  grammar 
whenever  possible.  We  use  the  hypothetical  grammar  begun  In 
2.2,  and  try  to  use  full  length  feature  names  for  easier 
unders'.anding. 
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When  the  function  PARSE  Is  called  with  a  first  argument 
which  has  not  been  defined  as  a  PROGRAMMAR  program#  It  checks  to 
see  whj:her  the  next  word  has  all  of  the  features  listed  In  the 
arguments.  If  so#  It  forms  a  new  node  pointing  to  that  word# 
with  a  list  of  features  which  Is  the  Intersection  of  the  list  of 
features  for  that  word  with  the  allowable  features  for  the  word 
class  Indicated  by  the  first  argument  of  the  call.  For  example# 
the  word  "blocks"  will  have  the  possibility  of  being  either  a 
plural  noun  or  a  th I rd-person-s I ngul ar  present-tense  verb. 
Therefore#  before  any  parsing  It  will  have  the  features  (NOUN 
VERB  N-PL  VB-3PS  TRANSITIVE  PRESENT).  If  the  expression  (PARSE 
VERB  TRANSITIVE)  Is  evaluated  when  "blocks"  Is  the  next  word  In 
the  sentence  to  be  parsed#  the  feature  list  of  the  resulting 
node  will  be  the  Intersection  of  this  combined  list  with  the 
list  of  allowable  features  for  the  word-class  VERB.  If  we  have 
def I ned: 

(DEFPROP  VERB  (VERB  INTRANSITIVE  TRANSITIVE  PRESENT  PAST 
VB-3PS  VB-PL)  ELIM)# 

the  new  feature  list  will  be  (VERB  TRANSITIVE  PRESENT  VB- 
3PS).  (ELIM  Is  simply  a  property  Indicator  chosen  to  Indicate 
this  list  which  ELIMtnates  features).  Thus#  even  though  words 
may  have  more  than  one  part  of  speech#  when  they  appear  In  the 
parsing  tree#  they  will  exhibit  only  those  features  relevant  to 
their  actual  use  In  the  sentence. 
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IjJLkl  Special  Words 

Some  words  must  be  handled  In  a  very  special  way  In  the 
grammar.  The  most  prevalent  are  conjunctions/  such  as  “and"  and 
“but".  When  one  of  these  Is  encountered/  the  normal  process  Is 
Interrupted  and  a  speclt.1  program  Is  called  to  decide  what  steps 
should  be  taken  In  the  parsing.  This  Is  done  by  giving  these 
words  the  grammatical  features  SPEC  or  SPECL.  Whenever  the 
function  PARSE  Is  evaluated/  before  returning  It  checks  the  next 
word  In  the  sentence  to  see  If  It  has  the  feature  SPEC.  If  sO/ 
the  SPEC  property  on  the  property  list  of  that  word  Indicates  a 
function  to  be  evaluated  before  parsing  continues.  This  program 
can  In  turn  call  PROGRAMMAR  programs  and  make  an  arbitrary 
number  of  changes  to  the  parsing  tree  before  returning  control 
to  the  normal  parsing  procedure.  SPECL  has  the  same  effect/  but 
Is  checked  for  when  the  function  PARSE  Is  called/  rather  than 
before  ft  returns.  Various  other  special  variables  and 
functions  allow  these  programs  to  control  the  course  of  the 
parsing  process  after  they  have  been  evaluated.  By  using  these 
special  wordS/  It  Is  possible  to  write  amazingly  simple  and 
efficient  programs  for  some  of  the  aspects  of  grammar  which 
cause  the  greatest  difficulty.  This  Is  possible  because  the 
general  form  of  the  grammar  Is  a  program. 

For  example/  “and"  can  be  defined  as  a  program  which  Is 


diagrammed 
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Parse  a  unit  of  the  same  type 
as  the  currently  active  node  "" 

I' 

Replace  the  node  with  a  new  node 
combining  the  old  one  and  the  one 
you  have  just  found 

Return  success 

Figure  49  --  Conjunction  Program 


For  example,  given  the  sentence  "The  giraffe  ate  the  apples 
and  peaches."  the  program  would  first  encounter  "and"  after 
parsing  the  NOUN  apples.  It  would  then  try  to  parse  a  second 


NOUN,  and  '.vould  succeed,  resulting  In  the  structure: 


"^Return  failure 


If  we  had  the  sentence,  "The  giraffe  ate  the  apples  and 
drank  the  vodka."  th<»  parser  would  first  try  the  same  thing. 
However,  "drank"  Is  not  a  NOUN,  so  the  AND  program  would  fall 
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and  the  NOUN  "apples"  m/ould  be  returned  unchanged.  This  would 
cause  the  NP  "the  apples"  to  succeed,  so  the  AND  program  would 
be  called  again.  It  would  fall  to  find  a  NP  beginning  with 
"drank",  so  the  NP  "the  apples"  would  be  returned,  causing  the 
VP  to  succeed.  This  time,  AND  would  try  to  parse  a  VP  and  would 
find  "drank  the  vodka".  It  would  therefore  make  up  a  combined 
VP  and  cause  the  entire  SENTENCE  to  be  completed  with  the 


structure: 


The  program  to  actually  do  this  would  take  only  3  or  4 
lines  In  a  PROGRAMMAR  grammar.  In  the  actual  system,  it  Is  more 
complex  as  It  handles  lists  (like  "A,  B,  and  C")  other 
conjunctions  (such  as  "but")  and  special  constructions  (such  as 
"both  A  and  B").  The  conjunction  program  Is  called  by  all  of 
the  conjunctions,  the  words  "either",  "neither",  and  "both",  and 
the  mark  ","  which  appears  as  a  separate  word  In  the  Input. 

The  function  **  Is  used  to  look  ahead  for  a  repetition  of 
the  special  word,  as  In  ". . .and. . .and. . .".  If  one  Is  found,  a 
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unit  of  the  type  most  recently  parsed  Is  parsed  again,  trying  to 
extend  all  the  way  to  the  repeated  conjunction  or  comma.  This 
Is  Iterated  as  long  as  there  are  repetitions,  with  special 
checks  made  for  structures  like  "A,  B,  and  C"  or  "A  and  B  but 
not  C".  As  each  new  node  Is  parsed.  Its  structure  Is  saved,  and 
when  the  last  Is  found,  a  new  node  Is  created  for  the  compound. 
Its  features  combine  those  for  the  type  of  conjunction  with 
those  appropriate  for  the  type  of  unit  (e.g.  a  compound  NG 
connected  with  "and"  Is  given  the  feature  "plural"  (NPD.)  The 
list  of  constituent  structures  Is  put  on  the  tree  as  a  list  of 
subnodes  of  the  conjoined  structure,  which  then  replaces  the 
original  unit  on  the  parsing  tree. 

Compounds  with  a  preceding  word  like  "both"  are  parsed 
differently,  since  the  word  Is  encountered  before  any  unit  has 
been  parsed.  In  this  case  It  Is  possible  to  adopt  the  more 
general  philosophy  of  attempting  the  longest  possible  unit 
first.  These  words  have  a  SPECL  definition,  so  the  program  Is 
called  as  the  next  unit  Is  about  to  be  parsed.  The  conjunction 
program  looks  for  the  matching  conjunction  ("and"  with  "both", 
"or"  with  "either",  and  "nor"  with  "neither")  and  tries  to  parse 
the  unit  expending  only  to  the  conjunction.  If  this  succeeds, 
the  normal  conjunction  procedure  Is  followed.  If  not,  some  sub¬ 
component  Is  the  conjoined  one,  and  nothing  happens  until  the 
parser  attempts  a  sub-unit,  when  the  process  Is  repeated. 

A  SPECL  program  can  modify  the  parsing  In  several  ways. 
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Por  example  It  can  call  the  function  FLUSHME^  which  simply 
removes  the  word  from  the  Input  sentence  (I.e.  It  Is  Ignored). 

It  can  take  arbitrary  actions  on  the  current  parsing  tree^  can 
Indicate  to  PROGRAMMAR  that  It  should  SKIP  parsing  the  unit  and 
use  Instead  results  provided  by  the  SPECL  program^  or  It  can 
Indicate  an  action  to  be  taken  after  the  normal  parsing  Is  DONE. 
Finally#  a  SPEC  or  SPECL  program  can  abort  the  entire  parsing# 
Indicating  a  response  to  the  user.  For  example#  the  word 
"thank"  calls  a  SPECL  program  which  checks  to  see  If  the  next 
word  Is  "you".  If  so#  the  parsing  Is  given  up#  and  the  system 
replies  "YOU'RE  WELCOME".  Currently  there  Is  no  backup 
procedure  to  modify  the  Interpretation  of  an  ambiguous  structure 
like  "A  and  B  or  C".  This  will  In  fact  be  parsed  as  (A  and  <B 
or  O).  Notice  that  "either  A  and  B  or  C"  will  be  parsed 
correctly  as  ((A  and  B)  or  C) . 

The  exact  format  for  a  SPEC  or  SPECL  definition  Is  a  LISP 
list  to  which  will  be  appended  two  I  terns  --  the  Initial  feature 
list  of  the  unit  being  parsed  and  an  Indicator  of  whether  It  Is 
a  word  or  a  unit  which  called  the  program.  The  resultant  form 


Is  then  EVALIed 
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Uul  £Q?&.es.s  Lves 

One  of  the  best  examples  of  the  advantages  of  procedural 
grammars  Is  the  ability  to  handle  left-branching  structures  like 
possessives.  In  a  normal  top-down  parser,  these  present 
difficulties,  since  any  NG  can  begin  with  a  possessive  NG,  which 
can  In  turn  begin  with  a  possessive  NG,  etc.,  as  In  "my  mother's 
sister's  student's  cat's  fur".  Special  care  must  be  taken  to 
avoid  Infinite  loops. 

In  our  granvnar  this  Is  handled  by  a  check  after  the  NOUN  or 
PRONOUN  Is  found  In  a  NG.  If  It  has  the  feature  "possessive" 
(POSS)  (e.g.  "my"  or  "block's")  a  node  Is  created  for  the  NG 
thus  far  parsed,  and  this  Is  placed  on  the  tree  as  a  constituent 
(the  determiner)  of  a  NG  to  be  continued.  The  program  then 
returns  to  the  point  where  It  was  after  finding  a  determiner, 
and  continues  looking.  This  can  happen  any  number  of  times,  but 
In  each  case  Is  triggered  by  the  presence  of  another  POSSesslve 
word.  It  therefore  loops  only  as  much  as  necessary.  This 
departure  from  top-down  parsing  Involves  no  changes  to  the 
parser,  and  only  a  simple  loop  In  the  program.  Any  other  left¬ 
branching  structure  can  be  handled  similarly. 


n 
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Ihs.  1  let! onarv 

Since  PROGRAMMAR  Is  embedded  In  LISP,  the  facilities  of 
LISP  for  handling  atom  names  are  used  directly.  To  define  a 
word,  a  list  of  grammatical  features  Is  put  on  Its  property  list 
under  the  Indicator  VIORD,  and  a  semantic  definition  under  the 
Indicator  SMKTC.  Two  facilities  are  Included  to  avoid  having  to 
repeat  Information  for  different  forms  of  the  same  word.  First, 
there  Is  an  alternate  way  of  defining  words,  by  using  the 
property  Indicator  WORDl.  This  Indicates  that  the  word  given  Is 
an  Inflected  form,  and  Its  properties  are  a  modified  form  of  tiie 
properties  of  Its  root.  A  WORDl  definition  has  three  elements, 
the  root  word,  the  list  of  features  to  be  added,  and  the  list  of 
features  to  be  removed.  For  example,  we  might  define  the  word 
"go"  by:  (DEFPROP  GO  (VERB  INTRANSITIVE  INFINITIVE)  WORD)  We 
could  then  define  "went"  as  (DEFPROP  WENT  (GO 
(PAST) ( INFINITIVE))  WORDl)  This  Indicates  that  the  feature 
INFINITIVE  Is  to  be  replaced  by  the  feature  PAST,  but  the  rest 
(Including  the  semantic  definition)  Is  to  remain  the  same  as  for 
"go". 

The  other  facility  Is  an  automatic  system  which  checks  for 
simple  modifications,  such  as  plurals,  "-Ing,"  forms,  "-er"  and 
"-est"  forms  and  so  forth.  If  the  word  as  typed  In  is  not 
defined,  the  program  looks  at  the  way  It  Is  spelled,  tries  to 
remove  Its  ending  (taking  Into  account  rules  such  as  changing 
"running"  to  "run",  but  "buzzing"  to  "buzz"). 


It  then  tries  to 


Section  2.4.4  -  Page  185 


flno  a  definition  for  the  reduced  root  word,  and  If  It  succeeds. 
It  makes  the  appropriate  changes  for  the  ending  (such  as 
changing  the  feature  SINGULAR  to  PLURAL).  The  program  which 
does  this  Is  not  a  part  of  the  PROGRAMMAR  system  but  Is 
specifically  built  for  English.  It  Is  described  In  section 
2.3.12. 

Everything  else  described  In  this  section  Is  designed 
ganerdlly  for  the  parsing  of  any  language.  In  any  particular 
language,  this  Input  funtlon  would  have  to  be  written  according 
to  the  special  rules  of  morphographemlc  structure.  The 
requirement  for  such  a  program  is  that  Its  output  must  be  a 
list,  each  member  of  which  corresponds  to  a  word  In  the  original 
sentence,  and  Is  in  the  form  described  In  section  2.4.8.  This 
list  Is  bound  to  the  variable  SENT,  and  is  the  way  In  which 
PROGRAMMAR  sees  Its  Input. 

The  other  form  of  data  In  the  dictionary  Is  tables  of  verb- 
particle  and  preposition-preposition  combinations  like  "pick  up" 
or  "on  top  of".  The  table  Is  stored  on  the  property  list  of  the 
initial  word  under  the  Indicator  PRTT  or  PREPP  respectively.  It 
Is  an  association  list,  each  member  of  which  has  the  second  word 
of  the  combination  as  Its  CAR,  and  a  word  parsing  structure  (see 
section  2.4.8)  as  its  CADR.  There  may  be  more  than  one 
combination  for  the  same  Initial  word  (e.g.  "pick  up",  "pick 
out"),  and  a  three-word  combination  can  be  defined  by  making  the 
CADR  be  an  association  list  of  the  same  form  for  a  third  word. 
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2^4.5  Backup  Fact! I  ties 

As  explained  In  section  2.2.7^  there  l»  no  automatic 
backup,  but  there  are  a  number  of  special  functions  which  can  be 
used  In  writing  grammars.  The  simplest,  (POPTO  X)  simply 
removes  nodes  from  the  tree.  The  argument  Is  a  list  of 
features,  and  the  effect  Is  to  remove  daughters  of  the  currently 
active  node,  beginning  with  the  rightmost  and  working  leftward 
until  one  Is  reached  with  all  of  those  features.  (POP  X)  Is  the 
same,  except  that  It  also  removes  the  node  with  the  Indicated 
features.  If  no  such  node  exists,  neither  function  takes  any 
action.  (POP)  Is  the  same  as  (POP  NIL),  and  a  non-nil  value  Is 
returned  by  both  functions  If  any  action  has  been  taken. 

A  very  Important  feature  Is  the  CUT  variable.  One  way  to 
do  backup  Is  to  first  try  to  find  the  longest  possible 
constituent  at  any  point,  then  If  for  any  reason  an  Impasse  Is 
reached,  to  return  and  try  again  ,  limiting  the  consituent  from 
going  as  far  along  In  the  sentence.  For  example.  In  the 
sentence  “Was  the  typewriter  sitting  on  the  cake?",  the  parser 
will  first  find  the  auxllllary  verb  "was",  then  try  to  parse  the 
subject.  It  will  find  the  noun  group  "the  typewriter  sitting  on 
the  cake",  which  In  another  context  might  well  be  the  subject 
("the  typewriter  sitting  on  the  cake  Is  broken.").  It  then 
tries  to  find  the  verb,  and  discovers  none  of  the  sentence  Is 
left.  To  back  up.  It  must  change  the  subject.  A  very  clever 
program  would  look  at  the  structure  of  the  noun  group  and  would 


Section  2.4.5  -  Page  187 


realize  that  the  modifying  clause  "sitting  on  the  cake"  must  be 
dropped.  A  more  simple-minded  but  still  efTectlve  approach 
would  use  the  following  Instructions: 

(**  N  PW) 

(POP) 

((CUT  PTW)SUBJECT  (ERROR)) 

The  first  command  sets  the  pointer  PTW  to  the  last  word  In 
the  constituent  (In  this  case,  "cake").  The  next  removes  that 
constituent.  The  third  sets  a  special  pointer/  CUT  to  that 
location/  then  sends  the  program  back  to  the  point  where  It  was 
looking  for  a  subject.  It  would  now  try  to  find  a  subject 
again/  but  would  not  be  allowed  to  go  as  far  as  the  word  "cake". 
It  might  now  find  "the  typerwriter  sitting/"  an  analog  to  "The 
man  sitting  Is  my  uncle."  If  there  were  a  good  semantic 
program/  It  would  realize  that  the  verb  "sit"  cannot  be  used 
with  an  Inanimate  object  without  a  location  specified.  This 
would  prevent  the  constituent  "the  typewriter  sitting"  from  ever 
being  parsed.  Even  If  this  does  not  happen/  the  program  would 
fall  to  find  a  verb  when  It  looked  at  the  remaining  sentence/ 

"on  the  cake."  By  going  through  the  cutting  loop  again/  It  wo  ild 
find  the  proper  subject/  "the  typewriter/"  and  would  continue 
through  the  sentence. 

Once  a  CUT  point  has  been  set  for  any  active  node/  no 
descendant  of  that  node  can  extend  beyond  that  point  until  the 
CUT  Is  moved.  Whenever  a  PROGRAMMAR  program  Is  called/  the 
varlaMe  END  Is  set  to  the  current  CUT  point  of  the  node  which 
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called  It.  The  CUT  point  for  each  constituent  Is  Initially  set 
to  Its  E|;D.  When  the  function  PARSE  Is  called  for  a  word.  It 
first  checks  to  see  If  the  current  CUT  has  been  reached  (I.e.  N 
and  CUT  are  the  same),  and  If  so  It  falls.  The  third  branch  In 
a  three-direction  branch  statement  Is  taken  If  the  current  CUT 
point  has  been  reached.  The  CUT  pointer  Is  set  with  the 
function  CUT  of  one  argument. 
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Auxin  tarv  Functions 

Since  PROGRAMMAR  grammars  are  programs#  they  can  call 
subroutines  just  as  any  other  program.  These  subroutines  can 
use  PROGRAMMAR  primitives  like  PARSE  and  *,  as  well  as  returning 
values  for  use  In  the  main  program.  One  example  used  In  our 
grammar  Is  UPCHECK#  used  to  see  If  the  current  node  Is  embedded 
In  a  structure  which  could  be  an  UPREL  like  "the  man  I  wanted 
you  to  see."  It  Is  used  In  conjunction  with  UPMOD  which  makes 
the  appropriate  changes  to  the  parsing  tree.  They  both  use 
primitives  like  *  to  find  and  change  the  elements. 

In  order  to  simplify  the  search  for  rank-shifted  clauses#  a 
function  PARSEREL  was  written.  It  takes  as  arguments  a  list  of 
clause  types  (like  REPORT#  ING#  etc.)  a  corresponding  list  of 
features  to  look  for  an  the  mam  verb#  a  pointer  to  that  verb# 
and  the  rest  of  the  Information  to  be  Included  In  the  call  to 
PARSE.  PARSEREL  then  loops  through  these  lists#  attempting  to 
parse  various  types  of  RSNG  clauses  If  they  are  In  accord  with 
the  restrictions  associated  with  the  verb  and  the  use  of  the 
clause  In  the  sentence.  It  uses  the  function  PARSE  to  modify 
the  parsing  tree  before  returning  to  the  main  CLAUSE  program. 


Section  2.4.7  -  Page  190 


laJLj.  Messages 

To  write  good  parsing  programs#  we  may  at  times  want  to 
know  why  a  particular  PROGRAMMAR  program  failed#  or  why  a 
certain  pointer  command  could  not  be  carried  out.  In  order  to 
facilitate  this#  two  message  variables  are  kept  at  the  top  level 
of  the  system#  MES#  and  MESP.  Messages  can  be  put  on  MES  In  two 
ways#  either  by  using  the  special  failure  directions  In  the 
branch  statements  (see  section  2.2.5)  or  by  using  the  functions 
M  and  MQ#  which  are  exactly  like  F  and  FQ,  except  they  put  the 
Indicated  feature  onto  the  message  list  ME  for  tliat  unit.  Vi/hen 
a  u;i  1 1  returns  either  failure  or  success#  MES  Is  bound  to  the 
current  value  of  ME#  so  the  calling  program  can  receive  an 
arbitrary  list  of  messages  for  whatever  purpose  It  may  -ai.t 
then.  MESP  always  contains  the  last  failure  message  received 


from  or  *. 


Section  2.4.8  -  Page  191 


l^jul  Ih£  £2xid  iL£  lh£.  Par.s,.ljig  Iras 

Each  node  Is  actually  a  list  structure  with  the  following 
I nformat I  on: 


FE 

NB 

N 

H 


SM 


the  list  of  features  associated  with  the  node 
the  place  In  the  sentence  where  the  constituent 
begins 

the  place  Immediately  after  the  constituent 
*he  subtree  below  that  node  (actually  a  list  of 
Its  daughters  In  reverse  order#  so  that 
H  points  to  the  last  constituent  parsed) 
a  space  reserved  for  semantic  Information 


These  symbols  can  be  used  In  two  ways.  If  evaluated  as 
variables#  they  will  always  return  the  designated  Information 
for  the  currently  active  node.  C  Is  always  a  pointer  to  that 
node.  If  used  as  functions  of  one  argument#  they  give  the 
appropriate  values  for  the  node  pointed  to  by  that  argument;  so 
(NB  H)  gives  the  location  In  the  sentence  of  the  first  word  of 
the  last  constituent  parsed#  while  (FE(NB  H))  would  give  the 
feature  list  of  that  word. 

Each  word  In  the  sentence  Is  actually  a  list  structure 
containing  the  4  items: 

FE  as  above 

SMWORO  the  semantic  definition  of  the  word 

WORD  the  word  Itself  (a  pointer  to  an  r.tom) 

ROOT  the  root  of  the  word  (e.g.  "run"  If  the 

word  Is  "running"). 
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2.4.9  Variables  Ma 1 hta 1 ned  by  the  System 

There  >re  two  types  of  yarlables,  those  bound  at  the  top 


level,  and  those  which  are  rebound  every  time  a  PROGHAMMAR 
program  is  called. 


Variables  bound 
N 

SENT 
PT  PTW 

Ml'S  MESP 


at  the  top  level 

Always  points  to  next  word  In  the 
sentence  to  be  parsed 
Always  points  to  the  entire  sentence 
Tree  and  sentence  pointers. 

See  Section  2.4.10 

List  of  messages  passed  up  from  lower 
levels.  See  Section  2.4.7 


Special  variables  bound  at  each  level 


C  FE  NB  SM  H 
NM  CUT  END 

UNIT 

REST 


T1  T2  T3 

MVB 

Ml' 


See  section  2.4.8 
See  section  2.4.5.  MM  always 
equals  (N0T(EQ  N  CUT)) 
the  name  of  the  currently  active 
PROGRAMMAR  program 
the  list  of  arguments  for  the  call 
to  PARSE  (These  form  the  Initial 
feature  list  for  the  node,  but  as 
other  features  are  added,  REST 
continues  to  hold  only  the  original 
ones . ) 

Three  temporary  PROG  variables  for  use 
by  the  program  In  any  way  needed. 
Bound  only  when  a  CLAUSE  Is  parsed 
used  as  a  pointer  to  the  main  verb 
List  of  messages  to  be  passed  up  to 
next  level  See  Section  2.4.7 
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Pointers 

The  system  always  maintains  two  pointers/  PT  to  a  place  on 
the  parsing  tree#  and  PTIV  to  a  place  In  the  sentence.  These  are 
moved  by  the  functions  *  and  ♦*  respectively/  as  explained  In 
section  2.2.10.  The  Instructions  for  PT  are: 


C 

H 

DL 

DLC 


OF 

PV 

NX 

U 

N 


set  PT  to  the  currently  active  node 
set  PT  to  most  recent  (rightmost)  daughter  of  C 
(down-last)  move  PT  to  the  rightmost  daughter 
of  Its  current  value 

(down-last  completed)  like  01/  except  It  only 
moves  to  nodes  which  are  not  on  the  push-down 
list  of  active  nodes. 

(down-first)  like  OL/  except  the  leftmost 
(previous)  move  PT  to  Its  1 ef t^adjacent  sister 
(next)  move  PT  to  Its  right-adjacent  sister 
(up)  move  PT  to  parent  node  of  Its  current  value 
Move  PT  to  next  word  In  sentence  to  be  parsed 


The  pointer  PTW  always  points  to  a  place  In  the  sentence. 


It  Is  moved  by  the  function  *♦  which  has  the  sa-^e  syntax  as  */ 
and  the  commands: 


N  Sdt  PTW  to  the  next  word  In  the  sentence 

FW  (first-word)  set  PTW  to  the  first  word  of  the 

constituent  pointed  to  by  PT 
LW  (last-word)  like  FW 

AW  (after-word)  like  FW/  but  first  word  after  the 

constituent 

NW  (next-word)  Set  PTW  to  the  next  word  after  Its 

current  value 

PW  (previous-word)  like  NW 

SFW  (sentence-first-word)  set  PTW  to  the  first  word 

In  tne  sentence 

SLW  (sentence-last-word)  like  SFW 

Since  the  pointers  are  bound  at  the  top  level/  a  program 
which  calls  others  which  move  the  pointers  may  want  to  preserve 
their  location.  PTV\I  Is  a  simple  variable/  and  can  be  saved  with 
a  SETQ/  but  PT  operates  by  keeping  track  of  the  way  It  has  been 
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moved.  In  order  to  be  able  to  retrace  Its  steps.  This  Is 
necessary  since  LISP  lists  are  threaded  In  only  one  direction 
(In  this  case,  from  the  parent  node  to  Its  daughters,  and  from  a 
right  sister  to  its  left  sister).  The  return  path  Is  bound  to 
the  variable  PTR,  and  the  command  (PTSV  X)  saves  the  values  of 
both  PT  anti  PTR  under  the  variable  X,  while  (PTRS  X)  restores 
both  values. 
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2.^ .11  ££a.tur9  Manipulating 

As  explained  In  section  2.2.6#  we  must  be  able  to  attach 
features  to  nodes  In  the  tree.  The  functions  F#  FQ#  and  TRNSF 
are  used  for  putting  features  onto  the  current  node#  while  R  and 
RQ  remove  them.  (F  A)  sets  the  feature  list  FE  to  the  union  of 
Its  current  value  with  the  list  of  features  A.  (FQ  A)  adds  the 
single  feature  A  (I.e.  It  quotes  Its  argument).  (TRNSF  A  B)  was 
explained  In  Section  2.2.7.  R  and  RQ  are  Inverses  of  F  and  FQ. 
The  functions  ISX#  ISQ#  CQ#  and  NQ  ire  used  to  examine  features. 
If  A  points  to  a  node  of  the  tree  or  word  of  the  sentence#  and  B 
points  to  a  feature#  (ISX  A  B)  returns  non-nil  If  that  node  has 
that  feature.  (ISQ  A  B)  Is  equivalent  to  (IS  A  (QUOTE  B))#  (CQ 
B)  Is  the  same  as  (ISQ  C  6)  (where  C  always  points  to  the 
currently  active  node)#  and  (NQ  B)  is  the  same  as  (ISQ  N  B)  (N 
always  points  to  the  next  word  In  the  sentence  left  to  be 
parsed) . 

The  function  NEXTW  checks  to  see  of  the  root  of  the  next 
wrrd  matches  the  argument.  (NEXTW  BE)  evaluates  to  non-NIL  omy 
If  the  next  word  Is  some  form  of  the  verb  "be**.  PUTF  and  REMF 
are  used  to  add  and  remove  features  from  some  node  other  than 
the  current  one.  They  are  FEXPRS  whose  argument  Is  a  list  of 
features#  which  are  put  on  or  removed  from  the  node  currently 
pointed  to  by  the  pointer  PT. 
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2.5  Comparison  with  Other  Parsers 
1^5 . 1  OJ_der  Pjir  sor  s 

Vihef'  work  first  began  on  analyzing  natural  language  with 
computers^  no  theories  of  syntax  existed  v/hich  were  explicit 
enough  to  be  used.  The  early  mach I ne- trans 1 ator  designers  were 
forced  to  develop  their  ovjn  linguistics  as  they  worked^  and  they 
produced  rough  and  ready  versions.  The  parsers  were  collections 
of  "packaging  routines”,  "Inserted  structure  passes",  "labeling 
subroutines",  etc.  (see  <Gorvln>)  which  evolved  gradually  as 
the  grammars  were  expanded  to  handle  more  and  more  complex 
sentences.  They  had  the  same  difficulties  as  any  program 
designed  in  this  wav  --  as  they  became  more  complex  It  becam.e 
harder  and  harder  to  un'jerstand  the  Interactions  within  them. 
Making  extensions  which  v/ere  Intended  to  deal  with  a  limited 
anticipated  set  of  Inputs  tended  to  make  It  difficult  to  extend 
the  system  later. 

When  the  machine-translation  effort  failed.  It  seemed  clear 
that  it  had  been  premature  to  try  handling  all  of  English 
without  a  better  background  of  r.ngulstlc  theory  and  an 
understanding  of  the  mathematical  properties  of  grammars. 
Computer  programs  for  natural  language  took  tv/o  separate  paths. 
The  first  was  to  Ignore  traditional  syntax  entirely,  and  to  use 
some  sort  of  more  general  pattern  matching  process  to  get 
Information  out  of  sentences.  Systems  such  as  STUDENT,  SIR, 
ELIZA,  and  Semantic  Memory  made  no  attempt  do  a  complete 
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syntactic  analysis  of  the  Inputs.  They  either  limited  tl.e  user 
to  a  small  set  of  fixed  Input  forms  or  limited  their 
understand i ng  to  those  things  they  could  get  while  Igro-ing 
syntax. 

The  other  approach  was  to  taka  a  simplified  subset  of 
English  which  could  be  handled  by  a  wel 1 -unders tood  form  of 
grammar,  such  as  one  of  the  variations  of  context-free  grammars. 
There  has  been  much  Interesting  research  on  the  proper.les  of 
abstract  languages  and  the  algorithms  needed  to  parse  them. 

Using  this  theory,  a  series  of  parsing  algorithms  and 
representations  v^ere  developed.  For  a  summary  of  the  computer 
parsers  designed  before  1966,  see  <Bobrow  1964>.  A  n:ore  recent 
development  was  Early's  context-free  parser  <Ear1y>  which 
operates  In  a  time  proportional  to  the  cube  of  the  length  of  a 
sentence . 

The  problem  faced  by  all  of  these  parsers  (Including  the 
mammoth  Harvard  Syntactic  Analyzer  (<Kuno>).  Is  that  such 
simple  models  arc  not  adequate  for  handling  the  full  complexity 
of  natural  language.  This  Is  discussed  theoretically  In 
<Chomsl<:y  1957>  but  for  our  purposes  It  Is  more  Important  to  note 
that  many  aspects  which  could  theoretically  be  handled  would  be 
included  only  at  the  expense  of  gross  Inefficiency  and 
unnecessary  complexity. 

Several  people  attempted  to  use  Chomsky's  transformational 
grammar  as  the  basis  for  parsers,  (see  <PetrIck>  and  <Zwlcky>) 
They  tried  to  "unwind"  the  transformations  to  reproduce  the  deep 
structure  of  a  sentence,  which  could  then  be  parsed  by  a  context 
free  "base  component".  It  soon  became  apparent  that  this  was  a 
very  difficult  task.  Although  transformational  grammar  Is 
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theoretically  a  "neutral"  description  of  language^  It  Is  In 
fact  highly  biased  toward  the  process  of  generating  sentences 
rather  than  Interpreting  them.  Adapting  generation  rules  to  use 
In  interpretation  Is  relatively  easy  for  a  context-free  grammar, 
but  extremely  difficult  for  transformational  gramniars.  <Woods 
1969>  discusses  the  problems  of  "comb  1 na tor  I al  explosion" 
Inherent  in  the  Inverse  transformational  process.  The 
transformational  parsers  have  not  gone  beyond  the  stage  of 
handling  small  subsets  of  English  In  an  Inefficient  way. 
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Z .  ?  .  2  Au.-imcnted  Transition  Networks 

In  the  past  two  years#  three  related  parsing  systems  have 
been  developed  to  deal  with  the  full  complexity  of  natural 
language.  The  first  was  by  Thorne#  Bratley#  and  Dewar  (<Thorne 
1968  and  1969>)#  and  the  more  recent  ones  are  by  Bobrow  and 
Fraser  (<Sobrow  1969>)  and  Woods  (<V/oods  1969>).  Tht  three 
programs  operate  In  very  similar  ways#  and  since  Woods'  Is  the 
most  advanced  and  best  documented#  we  will  use  It  for 
comparison.  In  his  paper  Woods  compares  his  system  with  the 
other  two. 

The  basic  Idea  of  these  parsers  Is  the  "augmented 
transition  network".  The  parser  Is  seen  as  a  transition  network 
much  like  a  finite-state  recognizer  used  for  regular  languages 
In  automata  theory. 

The  first  extension  Is  In  allowing  f'.e  networks  to  make 
recursive  calls  to  other  networks  (or  to  themselves).  The 
condition  for  following  a  particular  state  transition  Is  not 
limited  to  examining  a  single  Input  symbol.  The  condition  on 
the  arc  can  be  something  like  "NP"  where  NP  Is  the  name  of  an 
Initial  state  of  another  network.  This  recursively  called  NP 
network  then  examines  the  Input  and  operates  as  a  recognizer. 

If  It  ever  reaches  an  accepting  state#  It  stops#  and  parsing 
continues  from  the  end  of  the  NP  arc  In  the  original  network. 
These  "recursive  transition  networks"  have  the  power  of  a 
context-free  grammar#  and  the  corras^ ; -"cg  between  a  network 
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and  Its  equivalent  grammar  Is  quite  simple  and  direct. 

To  parse  the  full  range  of  natural  language,  we  need  a 
critical  addition.  Instead  of  using  "recursive  transition 
networks"  these  parsers  use  "augmented  transition  networks", 
which  can  "make  changes  In  the  contents  of  a  set  of  registers 
associated  with  the  network,  and  whose  transitions  can  be 
conditional  on  the  contents  of  those  registers.  (<Woods  1960>). 
This  Is  done  by  "adding  to  each  arc  of  the  transition  network  an 
arbitrary  condition  which  must  be  satisfied  In  order  for  the  arc 
to  be  followed,  and  a  set  of  structure  building  actions  to  be 
executed  If  the  arc  Is  followed." 

Augmented  tran:, Itlon  networks  have  the  power  of  Turing 
machines  (since  they  have  changeaole  registers  and  can  transfer 
control  depending  on  the  state  of  those  registers).  Clearly 
they  can  handle  any  type  of  grammar  which  could  possibly  be 
parsed  by  any  machine.  The  advantages  lie  In  the  ways  In  which 
these  augmented  networks  are  close  to  the  actual  operations  of 
language,  and  give  a  natural  and  understandable  representation 
for  grammars. 
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Networks  Pror.roms 

How  does  this  "ype  of  parser  compare  with  PROHRAMMAR?  Is 
there  anything  In  common  between  grammars  which  are  networks  and 
grammars  which  are  programs?  The  reader  may  have  already  seen 
the  **joke”  in  this  question.  In  fact  these  are  just  tvo 
different  ways  of  talking  about  doing  exactly  the  same  thingl 

Picture  a  flowchart  for  a  PROGRAMMAR  grammar^  In  v/hfch 
calls  to  the  function  PARSE  are  drawn  on  the  arcs  rather  chan  at 
tiie  nodes.  Every  arc  then  Is  either  a  reauest  to  accept  the 
next  word  In  the  Input  (when  the  argument  of  PARSE  Is  a  word 
class)^  or  a  recursive  call  to  one  of  the  grammar  prograrns,  Ac 
each  node  (l.e.  segment  of  program  between  conditionals  and 
PARSE  calls)  we  have  set  of  arbitrary  structure  building 
action;. Our  flowchart  Is  Just  like  an  augmented  transition 
network. 

Mow  picture  how  Woods*  networks  are  fed  to  the  computer. 

He  uses  a  notation  (see  <Woods  1969>  p.  17)  which  looks  very 
much  like  a  L I SP-embedded  computer  language^  such  as  PROGRAMMAR 
or  PLANNER.  In  fact^  the  networks  could  be  translated  almost 
directly  into  PLANNER  programs  (PLANNER  rather  than  LISP  or 
PROGRAMMAR  because  of  the  automatic  backup  features  --  see 
discussion  below). 

It  Is  an  Interesting  lesson  In  computer  science  to  look  at 
Woods*  discussion  of  the  advantages  of  networks,  and  **t rans  1  ate** 
them  Into  the  advantages  of  programs.  For  example,  he  talks 
about  efficiency  of  representation.  **A  major  advantage  of  the 
transition  network  model  Is... the  ability  to  merge,  the  common 
parts  of  many  context  free  rules,** 

Looking  at  grammars  as  programs,  v/e  can  call  this  **sharlng 
subroutines**.  He  says  "The  augmented  transition  netv/ork, 
through  its  use  of  flags  allows  for  the  merging  of  similar  parts 
of  the  network  by  recording  Information  In  registers  and 
Interrogating  It. ..and  to  merge  states  whose  transitions  are 
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similar  except  for  conditions  on  the  contents  of  the  registers.** 
This  Is  the  use  of  subroutines  with  parameters.  In  addition/ 
the  networks  can  '*capture  the  regularities  of  the 
1  anguoge . . .  v/henever  there  arc  two  essentially  Identical  parts  of 
the  grammar  which  differ  only  In  that  the  finite  control  part  of 
the  machine  Is  remembering  some  piece  of  1 nformat Ion. . . I t  Is 
sufficient  to  explicitly  store  the  distinguishing  piece  of 
Information  In  a  register  and  use  only  a  single  copy  of  the 
subgraph.**  This  Is  clearly  the  use  of  subroutines  with  an 
argument ! 

Similarly  v/e  can  go  through  the  arguments  about  efficiency, 
the  ease  of  mixing  semantics  with  syntax,  the  ability  to  Include 
operations  which  are  **natural**  to  the  task  of  natural  language 
analysis,  etc.  All  of  them  apply  Identically  whether  we  are 
looking  at  **transltlon  networks**  or  "programs**. 

What  about  **persplcu I  ty"7  Woods  claims  that  augmented 
transition  networks  retain  the  perspicuity  (ease  of  reading  and 
understanding  by  humans)  of  simpler  grammar  forms.  He  says  that 
transformational  grammars  have  the  problem  that  *'the  effect  of  a 
given  rule  Is  Intimately  bound  up  with  Its  Interrelation  to 
other  rules. ..It  may  require  an  extremely  complex  analysis  to 
determine  the  effect  and  purpose."  ((Woods  1969>  p.38)  This  Is 
true,  but  It  would  also  be  true  for  any  grammar  complex  enough 
to  handle  all  of  natural  language.  The  simple  examples  of 
transition  netwo'*ks  are  Indeed  easy  to  read  (as  are  simple 
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examples  of  most  grammars)#  but  In  a  network  for  a  complete 
language#  the  purpose  of  £  given  state  would  be  Intimately  bound 
up  with  Its  Interrelation  to  other  states#  and  the  grammar  will 
not  be  as  "perspicuous"  as  we  might  hope.  This  is  just  as  true 
for  programs#  but  no  more  so.  If  we  look  at  the  flow  chart 
Instead  of  the  listing#  programs  are  equally  perspicuous  to 
networks . 

If  the  basic  principles  are  really  the  same#  are  there  any 
differences  at  all  between  Woods'  system  and  ours?  The  ansv/er 
Is  yes#  they  differ  not  In  the  theoretical  power  of  the  parser# 
but  In  the  types  of  analysis  being  carried  out. 

The  most  Important  difference  Is  the  theory  of  grammar 
being  used.  All  of  the  network  systems  are  based  on 
transformational  grammar.  They  try  to  reproduce  the"deep 
structure"  of  a  sentence  while  doing  surface  structure 
recognition.  This  Is  done  by  using  special  commands  to 
explicitly  build  and  rearrange  the  deep  structures  as  'he 
parsing  goes  along.  PROGRAMMAR  Is  oriented  towards  systemic 
grammar#  with  Its  Identification  of  significant  features  In  the 
constituents  being  parsed.  It  therefore  emphasizes  the  ability 
to  examine  the  features  of  constituents  anywhere  on  the  parsing 
tree#  and  to  manipulate  the  feature  descriptions  of  nodes. 

In  section  2.1  we  discussed  the  advantages  of  systemic  grammar 
for  a  language  understanding  system.  Either  type  of  parser 
could  be  adapted  to  any  type  of  grammar#  but  PROGRAMMAR  was 
specially  designed  to  Include  "natural"  operations  for  systemic 
understanding  of  sentences. 

A  second  difference  Is  In  the  Implementation  of  special 
additions  to  the  basic  parser.  For  example  In  section  2.4.2  we 
discussed  the  way  In  which  words  like  "and"  could  be  defined  to 
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act  as  "demons"  which  Interrupt  the  parsing  at  whatever  point 
they  arc  encountered,  and  start  a  special  program  for 
Interpreting  conjoined  structures.  This  has  many  uses,  both  In 
Che  standard  parts  of  the  grammar  (such  as  "and")  and  In 
handling  Idioms  and  unusual  structures.  If  we  think  In  network 
terms,  this  Is  like  having  a  separate  arc  marked  "and"  leading 
from  every  node  In  the  network.  Such  a  feature  could  probably 
be  added  to  the  network  formulation,  but  It  seems  much  more 
natural  to  think  In  terms  of  programs  and  Interrupts. 

A  third  difference  Is  the  backup  mechanism.  The  network 
approach  assumes  some  form  of  nondeterminism.  If  there  are 
several  arcs  leaving  a  node,  there  must  be  some  way  to  try 
folloviing  all  of  them.  Either  we  have  to  carry  forv/ard 
simultaneous  Interpretations,  or  keep  track  of  our  choices  In 
such  a  way  that  the  network  can  automatically  revise  Its  choice 
If  the  original  choice  does  not  lead  to  an  accepting  state. 

This  could  be  done  In  the  program  approach  by  using  a  language 
such  as  PLANNER  with  Its  automatic  backup  mechanisms.  But  In 
section  2.2.7  we  discussed  the  question  of  whether  It  Is  even 
desirable  to  do  so  In  handling  natural  language. 

We  pointed  out  the  advantage  of  an  Intelligent  parser  which 
can  understand  the  reasons  for  Its  failure  at  a  certain  point, 
and  can  guide  Itself  accordingly  Instead  of  backing  up  blindly. 
This  Is  Important  for  efficiency,  and  Woods  Is  very  concerned 
with  ways  to  modify  the  netv/orks  to  avoid  unnecessary  and 
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wasteful  backup  by  "making  the  network  more  de term! r I s t I c . " 
(<Woods  1969>  p.  45).  It  might  be  Interesting  to  explore  a 
compromise  solution  In  which  automatic  backup  facilities 
existed,  but  could  be  turned  on  and  off.  Vie  could  do  this  by 
giving  PROGRAMMAR  special  commands  which  v/ould  cause  It  to 
remember  the  state  of  the  parsing  so  that  later  the  grammar 
could  ask  to  back  up  to  that  state  and  try  something  else.  This 
Is  an  Interesting  area  for  further  work  on  PROGRAMMAR. 

It  Is  difficult  to  compare  the  performance  of  different 
parsers  since  there  Is  no  standard  grammar  or  set  of  test 
sentences.  Bobrow  and  Woods  have  not  published  the  results  of 
any  experiments  with  a  large  grammar,  but  Thorne  has  published 
two  papers  (<Thorne  1968,  1969>)  with  a  number  of  sample 
parsings.  Our  system,  with  Its  current  grammar  of  English  has 
successfully  parsed  all  of  these  examples.  They  took  from  1  to 
5  seconds  apiece.  Some  samples  of  more  complicated  parsings 
Hone  by  the  system  are  given  In  Appendix  B. 


CHAPTER  3 


I nf erence 


3.1  Basfc  Approach  to  MeanJng 
3.1.1  Representing  K.aovy.ledg.e 

We  have  described  the  process  of  understanding  language  as 
a  conversion  from  a  string  of  of  sounds  or  letters  to  an 
Internal  representation  of  "meaning".  In  order  to  do  this,  a 
language-understanding  system  must  have  some  formal  way  to 
express  its  knowledge  of  a  subject,  and  must  be  able  to 
represent  the  "meaning"  of  a  sentence  In  this  formalism.  The 
formalism  must  be  structured  In  such  a  way  that  the  system  can 
use  Its  knowledge  to  make  deductions,  accept  new  Information, 
answer  questions,  and  Interpret  commands.  Choosing  a  form  for 
this  Information  Is  of  central  Importance  to  both  a  practical 
system  and  a  theory  of  semantics. 

First  we  must  decide  what  kinds  of  things  are  to  be 
represented  In  the  formalism.  As  a  beginning,  we  would  like  to 
be  able  to  represent  "objects",  "properties,"  and  "relations." 
Later  we  will  have  to  show  how  these  can  be  combined  to  express 
more  complicated  knowledge.  We  will  describe  ways  to  express 
the  meaning  of  a  wide  variety  of  complex  sentences. 

Using  a  simple  prefix  notation,  we  can  represent  such  facts 
as  "Boise  Is  a  city."  and  "Noah  was  the  father  of  Jafeth."  as: 

(CITY  BOISE)  (FATHER-OF  NOAH  JAFETK) 
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Here,  BOISE,  NOAH,  and  JAFETH  are  rpeclflc  objects,  CITY  Is 
a  property  which  objects  can  have,  and  FATHER-OF  Is  a  relation. 
It  Is  a  practical  convenience  to  list  properties  and  relations 
first,  even  though  this  may  not  follov;  the  natural  English 
order,  so  we  will  do  so  throughout.  Notice  that  properties  are 
In  fact  special  types  of  relations  which  deal  with  only  one 
object.  Properties  and  relations  will  be  dealt  with  In 
identical  ways  throughout  the  system.  In  fact.  It  Is  not  at  all 
obvious  which  concepts  should  be  considered  properties  and  which 
relations.  For  example,  "DeGaulle  Is  old.”  might  be  expressed 
as  (OLD  DEGAULLE)  where  OLD  Is  a  property  of  objects  or  as  (AGE 
DEGAULLE  OLD),  where  AGE  Is  a  relation  betv/een  an  object  and  Its 
age.  In  the  second  expression,  OLD  appears  In  the  position  of 
an  object,  even  though  It  can  hardly  be  construed  as  a 
particular  object  like  BOISE  or  DEGAULLE.  This  suggests  that  we 
might  like  to  let  properties  or  relations  themselves  have 
properties  and  enter  Into  other  relations.  This  has  a  deep 
logical  consequence  which  will  be  discussed  In  later  sections. 

In  order  to  avoid  confusion,  we  will  need  some  conventions 
about  notation.  Most  objects  and  relationships  do  not  have 
simple  English  names,  and  those  that  do  often  share  their  names 
with  a  range  of  ocher  meanings.  The  house  on  the  corner  by  the 
market  doesn't  have  a  proper  name  like  Jafeth,  even  though  It  Is 
just  as  much  a  unique  object.  For  the  Internal  use  of  the 
system,  we  will  give  It  a  unique  name  by  stringing  together  a 
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descriptive  word  and  an  arbitrary  number,  then  prefixing  the 
result  v/ich  a  cclon  to  remind  us  It  Is  an  object.  The  house 
mentioned  above  might  be  called  :H0USE374.  Properties  and 
relations  must  also  go  under  an  assumed  name,  since  (FLAT  X)  , 
might  mean  very  different  things  depending  on  whether  X  Is  a 
tire  or  a  musical  note.  We  can  do  the  same  thing  (using  a 
different  punctuation  mark,  #>  to  represent  these  two  meanings 
as  #FLAT1  and  #FLAT2.  When  the  meaning  Intended  Is  clear/  we 
will  omit  the  numbers,  but  leave  the  punctuation  marks  to  remind 
us  that  It  's  a  property  or  relation  rather  than  a  specific 
object.  Thus,  our  facts  listed  above  should  be  v/ritten; 

(#CITY  ;B0ISE)  (#FATMEP-0F  :N0AH  tJAFETH),  and  either 
(#0LD  sDEGAULLE)  or  (#AGE  zDEGAULLE  #0LD) . 
vie  are  letting  properties  serve  In  a  dual  function  --  we 
can  use  them  to  say  things  about  objects  (as  In  "The  sky  Is 
blue."  --  (#BLUE  ;SKY))  or  we  can  say  things  about  them  as  If 
they  were  objects  (as  In  "Blue  Is  a  color."  --  (fCOLOR  #BLUE)). 
Vie  want  to  extend  this  even  further,  and  allow  entire 
relationships  to  enter  Into  other  relationships.  (Vie 
distinguish  between  "relation",  the  abstract  symbol  such  as 
#FATHER-0F,  and  "relationship",  a  particular  Instance  such  as 
(#FATHER-0F  :N0AH  zJAFETH)).  In  accord  with  our  earlier 
convention  about  naming  things,  we  can  give  the  relationship  a 
name,  so  that  we  can  treat  It  like  an  object  and  say  (#KM0W  :l 
;REL76)  where  :REL76  Is  a  name  for  a  particular  relationship 
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like  (#FATHER-OF  :N0AH  rJAFETH).  We  can  keep  straight  which 
name  goes  with  which  relationship  by  putting  the  name  directly 
into  the  relationship.  Our  example  would  then  become  (#FATHER- 
OF  :N0AH  tJAFETH  :REL76).  There  Is  no  special  reason  to  put 
the  name  last,  except  that  It  makes  Indexing  and  reading  the 
statements  easier.  We  can  tell  that  ;REL76  Is  the  name  of  this 
relation,  and  not  a  participant  since  FATKER-OF  relates  only  two 
objects.  Similarly,  we  knew  that  It  has  to  be  a  participant  In 
the  relationship  (#KM0W  ;1  ;REL76)  since  #KN0W  needs  two 
arguments . 

V/e  now  have  a  system  which  can  be  used  to  describe  more 
complicated  facts.  "Harry  slept  on  the  porch  after  he  gave 
Alice  the  jewels."  v/ould  become  a  set  of  assertions; 

(#SLEEP  -.HARRY  :REL1)  (#L0CATI0N  :REL1  ;P0RCH) 

(#GIVE  ;HARRY  ;ALICE  :JEWELS  :REL2)  (#AFTER  :REL1  :REL2) 

This  example  points  out  several  facts  abut  the  notation. 

The  number  of  participants  in  a  relationship  depends  on  the 
particular  relation,  and  can  vary  from  0  to  any  number.  We  do 
not  need  to  give  every  relationship  a  name  --  It  Is  present  only 
if  we  want  to  be  able  to  refer  to  that  relationship  elsewhere. 
This  will  often  be  done  for  events,  which  are  a  type  of 
relationship  with  special  properties  (such  as  time  and  place  of 
occurrence) . 
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3.1.2  Phi losophlcal  Cons  I  derat  I ons 

Before  going  on,  let  us  stop  and  ask  what  v.e  are  doing.  In 
the  preceding  paragraphs,  v/e  have  developed  a  notation  for 
representing  certain  kinds  of  meaning.  In  doing  so  we  have 
glibly  passed  over  Issues  which  have  troubled  phllsophers  and 
linguists  for  thousands  of  years.  Countless  treatises  and 
debates  have  tried  to  analyze  just  what  It  means  to  be  an 
"object"  or  a  "property",  and  what  logical  status  a  symbol  such 
as  #BLUE  or  #CITY  should  have.  V'le  will  not  attempt  to  give  a 
philosophical  answer  to  these  questions,  but  Instead  take  a  more 
pragmatic  approach  to  meaning. 

Language  Is  a  process  of  communication  between  people,  and 
Is  Inextricably  enmeshed  in  the  knov/ledge  that  those  people  have 
about  the  world.  That  knowledge  Is  not  a  neat  collection  of 
definitions  and  axioms,  complete,  concise  and  consistent. 

Rather  It  is  a  collection  of  concepts  designed  to  manipulate 
ideas.  It  Is  In  fact  Incomplete,  highly  redundant,  and  often 
Inconsistent.  There  Is  no  self-contained  set  of  "primitives" 
from  which  everything  else  can  be  defined.  Definitions  are 
circular,  with  the  meaning  of  each  concept  depending  on  the 
other  concepts. 

This  might  seem  like  a  meaningless  change  --  saying  that  the 
meaning  of  words  Is  represented  by  the  equally  mysterious 
meanings  of  "concepts"  which  exist  In  the  speaker's  and  hearer's 
minds,  but  which  are  open  to  neither  Immediate  Introspection  nor 
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experiment.  However#  there  Is  a  major  difference.  The  structure 
of  concepts  which  Is  postulated  can  be  manipulated  by  a  logical 
system  v/lthin  the  computer.  The  "Internal  representation"  of  a 
sentence  Is  something  which  the  system  can  obey#  answer#  or  add 
to  Its  knowledge.  It  can  relate  a  sentence  to  other  concepts# 
draw  conclusions  from  It#  or  store  It  In  a  way  which  makes  It 
useable  In  further  deductions  and  analysis. 

This  can  be  compared  to  the  use  of  "forces"  In  physics.  We 
have  no  way  of  directly  observing  a  force  like  gravity#  but  by 
postulating  Its  existence#  we  can  write  equations  describing  It# 
and  relate  these  equations  to  the  physical  events  Involved. 
Similarly#  the  "concept"  representation  of  moaning  Is  not 
intended  as  a  direct  picture  of  something  which  exists  In  a 
person's  mind.  It  Is  a  fiction  of  the  scientist#  valid  only  In 
that  it  gives  him  a  way  to  make  sense  of  data#  and  predict 
actual  behavior. 

The  justification  for  our  use  of  concepts  In  this  system 
is  the  way  it  actually  carries  out  a  dialog  which  simulates  In 
many  ways  the  behavior  of  a  human  language  user.  For  a  wider 
field  of  discourse#  It  would  have  to  be  expanded  In  its  details# 
and  perhaps  In  some  aspects  of  its  overall  structure.  However 
the  Idea  is  the  same  --  that  we  can  In  fact  gain  a  better 
understanding  of  language  use  bv  postulating  these  fictitious 
concepts  and  structures#  and  analyzing  the  ways  in  which  they 
interact  with  language. 

The  success  of  such  a  theory  at  actualy  describing  language 
will  depend  largely  on  the  power  and  flexibility  of  the 
representation  used  for  the  concepts.  Later  sections  of  this 
chapter  discuss  the  reasons  why  PLANNER  Is  particularly  well 
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suited  for  this  job. 

We  v/ould  like  to  consider  some  concepts  as  "atomic'*.  (i.e. 
concepts  which  are  considered  to  have  their  own  meaning  rather 
than  being  just  combinations  of  other  more  basic  concepts).  A 
property  or  relation  is  atomic  not  because  of  some  special 
logical  status#  but  because  it  serves  a  useful  purpose  in 
relation  to  the  other  concepts  in  the  speaker's  model  o*  the 
world.  For  example#  the  concept  #OLD  is  surely  not  primitive# 
since  it  can  be  defined  in  terms  of  #AGE  and  number.  However# 
as  an  atomic  property  it  will  often  appear  in  knowledge  about 
people#  the  way  they  look#  the  v;ay  they  act#  etc.  Indeed#  we 
could  omit  it  and  always  express  something  like  "having  an  age 
greater  than  30"#  but  our  model  of  the  world  will  be  simpler  and 
more  useful  if  we  have  the  concept  #0LD  available  instead. 

There  is  no  sharp  line  dividing  atomic  concepts  from  non- 
atomic  ones.  It  would  be  absurd  to  have  separate  atomic 
concepts  for  such  things  as  #C I TY-OF-POPULAT ! OH-23# 485  or 
#PERS0N-WEIGHIMG-BETWEEN-178-AND-181.  But  it  might  in  fact  be 
useful  to  distinguish  between  #BIG-C1TY#  #T0WN#  end  #VILLAGE#  or 
between  #FAT#  and  #THIN#  since  our  model  may  often  use  these 
distinct! ons . 

If  our  "atomic"  concepts  are  not  logically  primitive#  what 
kind  of  status  do  they  have?  V^/hat  Is  their  "meaning"?  How  are 
they  defined?  The  answer  Is  again  relative  to  the  world-model 
of  the  speaker.  Facts  cannot  be  classified  as  "those  which 
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define  a  concept*'  and  "those  which  describe  It."  Ask  someone  to 
define  ^PERSON  or  #JUSTICE/  and  he  will  come  up  with  a  formula 
or  slogan  which  Is  very  limited.  ^JUSTICE  Is  defined  In  his 
world-model  by  a  series  of  examples,  experiences,  and  specific 
cases.  The  model  Is  circular,  with  the  meaning  of  any  concept 
depending  on  the  entire  knowledge  of  the  speaker,  (not  just  the 
kind  which  would  be  Included  In  a  dictionary).  There  must  be  a 
close  similarity  between  the  models  held  by  the  speaker  and 
listener,  or  there  could  be  no  communication.  If  my  concept  of 
^DEMOCRACY  and  yours  do  not  coincide,  we  may  nave  great 
difficulty  understanding  each  other’s  political  viewpoints. 
Fortunately,  on  simpler  things  such  as  #BLUE,  #nOR,  and  #AFTER, 
there  Is  a  pretty  good  chance  that  the  models  will  be 
practically  Identical.  In  fact,  for  simple  concepts,  we  can 
choose  a  few  primary  facts  about  the  concept  and  use  them  as  a 
"definition"  ,  which  corresponds  to  the  traditional  dictionary. 

Returning  to  our  notation,  we  see  that  It  Is  Intentionally 
general,  so  that  out  system  can  deal  with  concepts  as  people  do. 
In  English  we  can  treat  events  and  relationships  as  objects,  as 
In  "The  war  destroyed  Johnson's  rapport  with  the  people."  Within 
our  representation  of  meaning  we  can  similarly  treat  an  event 
such  as  #WAR  or  a  relationship  of  #RAPPORT  In  the  same  way  we 
treat  objects.  Vie  do  not  draw  a  sharp  philosophical  distinction 
between  "specific  objects",  "properties",  relationships", 
"events",  etc. 
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Complex  Information 

We  now  have  a  way  to  store  a  data  base  of  assertions  about 
particular  objects,  properties,  and  relationships.  Next,  we 
want  to  handle  more  complex  Information,  such  as  "All  canaries 
are  yellow.",  or  "A  thesis  Is  acceptable  If  either  It  Is  long  or 
It  contains  a  persuasive  argument."  This  could  be  done  using 
a  formal  language  such  as  the  predicate  calculus.  Baste  logical 
relations  such  as  Implies,  or,  and,  there-exists,  etc.  are 
represented  symbolically,  and  Information  Is  translated  Into  a 
"formula".  Thus  we  might  have: 

(FORALL  (X)  <IMPLIES(#CANARY  X)(#C0L0R  X  #YELL0W))> 

(FORALL  (XXIMPLIES 

(AND  (#THESIS  X) 

(OR  (#LONG  X) 

(EXISTS  (Y) 

(AND  (#PERSUASIVE  Y) 

(♦ARGUMENT  Y) 

(♦CONTAINS  X  Y))))) 

(♦ACCEPTABLE  X))) 

Figure  52  --  Predicate  Calculus  Representation 


Several  notatlonal  conventions  are  used.  First,  we  need 
variables  so  that  we  can  say  things  about  objects  without  naming 
particular  ones.  This  Is  done  with  the  quantifiers  FORALL  and 
EXISTS.  Second,  we  need  logical  relations  like  AND,  OR,  NOT, 
and  IMPLIES.  Using  this  formalism,  we  can  represent  a  question 
as  a  formula  to  be  "proved".  To  ask  "Is  Sam's  thesis 
acceptable?"  we  could  give  the  formula  (fACCEPTABLE  :SAM-THESIS) 
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to  a  theorem  prover  to  prove  by  manipulating  the  formulas  and 
assertions  In  the  data  base  according  to  the  rules  of  logic.  We 
would  need  some  additional  theorems  which  would  allow  the 
theorem  prover  to  prove  that  a  thesis  Is  long/  that  an  argument 
Is  acceptable/  etc. 

In  some  theoretical  sense/  predicate  calculus  formulas 
could  express  all  of  our  knowledge/  but  In  a  practical  sense 
there  Is  something  missing.  A  person  would  also  have  knowledge 
about  how  to  go  about  doing  the  deduction.  He  would  know  that 
he  should  check  the  length  of  the  thesis  first/  since  he  might 
be  able  to  save  himself  the  bother  of  reading  It/  and  that  he 
might  even  be  able  to  avoid  counting  the  pages  If  there  Is  a 
table  of  contents.  In  addition  to  complex  Information  about 
what  must  be  deduced/  he  also  knows  a  lot  of  hints  and 
"heuristics"  telling  how  to  do  It  better  for  the  particular 
subject  being  discussed. 

Most  "theorem-proving"  systems  do  not  have  any  way  to 
Include  this  additional  Intelligence.  Instead/  they  are  limited 
to  a  kind  of  "working  In  the  dark".  A  uniform  proof  procedure 
gropes  Its  way  through  the  collection  of  theorems  and 
assertions/  according  to  some  general  procedure  which  does  not 
depend  on  the  subject  matter.  It  tries  to  combine  any  facts 
which  might  be  relevant/  working  from  the  bottom  up.  In  our 
example  given  above/  we  might  have  a  very  complex  theorem  for 
deciding  whether  an  argument  Is  persuasive.  A  uniform  proof 
procedure  might  spend  a  great  deal  of  time  checking  the 
persuasiveness  of  every  argument  It  knew  about/  sines  a  clause 
of  the  form  (PERSUASIVE  X)  might  be  relevant  to  the  proof.  What 
we  would  prefer  Is  a  way  for  a  theorem  to  guide  the  process  of 
deduction  In  an  Intelligent  way.  Carl  Hewitt  has  worked  with 
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this  problem  and  has  developed  a  theorem-proving  language  called 
PLANNER  <HewItt  1968,  196S>.  In  PLANNER,  theorems  are  In  the 
form  of  programs,  whic.'i  describe  how  to  go  about  proving  a  goal, 
or  how  to  deduce  consequences  from  an  assertion.  This  Is 
described  at  length  In  section  3.3,  and  forms  the  basis  for  the 
Inference  part  of  our  English  understander.  In  PLANNER,  our 
sentence  about  thesis  evaluation  could  be  represented  as  shown 
In  Figure  53. 

This  Is  similar  In  structure  to  the  predicate  calculus 
representation  given  above,  but  there  are  Important  differences. 
The  theorem  Is  a  program,  where  each  logical  operator  indicates 
a  definite  series  of  steps  to  be  carried  out.  TH60AL  says  to 
try  to  find  an  assertion  In  the  data  base,  or  to  prove  It  using 
other  theorems.  THUSE  gives  advice  on  what  othc'  theorems  to 
use,  and  In  what  order.  THAND  and  THOR  are  equivalent  to  the 
logical  AND  and  OR  except  that  they  give  a  specific  order  In 
which  things  should  be  tried.  (The  "lisping"  Is  to 
differentiate  PLANNER  names  from  the  standard  LISP  functions  AND 
and  OR.  This  same  convention  Is  used  for  all  functions  which 
have  LISP  analogs.) 

The  theorem  EVALUATE  says  that  If  we  ever  want  to  prove 
that  a  thesis  Is  acceptable,  we  should  first  make  sure  It  Is  a 
thesis  by  looking  In  the  data  base.  Next,  we  should  try  to 
prove  that  It  Is  long,  first  by  using  Che  theorem  CONTENTS-CHECK 
(which  would  check  the  table  of  contents),  and  If  that  falls,  by 
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(DEFINE  THEOREM  EVALUATE 

; EVALUATE  Is  the  name  we  ere 
;glvlng  to  the  theorem 

(THCONSECX  Y) 

;thls  Indicates  the  type  of 
; theorem  and  names  Its 
;var tables 

(THGOALC#THESIS  $?X)) 

;show  that  X  Is  a  thesis 
;the  "$?"  Indicates  a  variable 

(THOR 

;THOR  Is  like  "or",  trying  things 
;ln  the  order  given  until  one  works 

(THG0AL(#L0NG  $?X)<THUSE  CONTENTS -CHECK  COUNTPAGES)) 

;THUSE  says  to  try  the  theorem 
;named  CONTEMTS-CHECK  first, 

;then  If  that  doesn't  work,  try 
; the  one  named  COUNTPAGES 


(THAND 


;THAND  Is  like  "and" 


(THGOALdCONTAINS  $?X  $?Y)) 

;flnd  something  Y  which  Is 
;contalned  In  X 

(THGOAL(#ARGUMENT  $?Y)) 

;show  that  It  Is  an  argument 


(THGOAL(#PERSUASIVE  $?Y)(THTBF  THTRUE)))))) 

;prove  that  It  Is  persuasive,  using 
;any  theorems  which  are  applicable 


Figure  53 


ANNER  Representation 
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using  a  theorem  named  CO'iNTPAGES  (which  might  In  fact  call  a 
simple  LISP  program  which  thumbs  through  the  paper.)  If  they 
both  fall,  then  we  look  In  the  data  base  for  something  contained 
In  the  thesis,  check  that  It  Is  an  argument,  and  then  finally 
try  to  prove  that  It  Is  persuasive.  Here,  we  have  used  (THTBF 
THTRUE),  which  Is  PLANNER'S  way  of  saying  "try  anything  you  know 
which  can  help  prove  It".  PLANNER  must  then  go  searching 
through  all  of  Its  theorems  on  persuas I vensess.  Just  as  any 
other  theorem  prover  would.  Thera  are  two  Important  changes, 
though.  First,  we  never  need  to  look  at  persuasiveness  at  all 
If  we  are  able  to  determine  that  the  thesis  Is  long.  Second,  we 
only  look  at  the  persuasiveness  of  arguments  which  we  already 
know  are  a  part  of  the  thesis.  We  do  not  get  sidetracked  Into 
looking  at  the  persuasiveness  theorems  except  for  the  cases  we 
real  1 y  want . 

PLANNER  also  does  a  number  of  other  things,  like 
maintaining  a  dynamic  data  base  (assertions  can  be  added  or 
removed  to  reflect  the  way  tne  world  changes  In  the  course  of 
time),  allowing  us  to  control  how  much  deduction  will  be  done 
when  new  facts  are  added  to  the  data  base,  etc.  These  are  all 
discussed  In  section  3.3. 
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3.1.4  Ques  t  Ions.  StatemePts.  and  Commr.nds 

PLANNER  Is  particularly  convenient  for  a  language- 
understanding  system,  since  It  can  express  statements,  commands, 
and  questions  directly.  We  have  already  shown  how  assertions 
can  be  stated  In  simple  PLANNER  format.  Commands  and  questions 
are  also  easily  expressed.  Since  a  theorem  Is  written  In  the 
form  of 'a  procedure,  we  can  let  steps  of  that  procedure  actually 
be  actions  to  be  taken  by  a  robot.  The  command  "Pick  up  the 
block  and  put  It  In  the  box."  could  be  expressed  as  a  PLANNER 
program: 

(THAND(THGOAL<#PICKUP  :BLOCK23)) 

(THGOAL(#PUTIN  ;BLOCK23  :B0X7))) 

Remember  that  the  prefix  and  the  number  Indicate  a  specific 
object.  The  theorems  for  #PICKUP  and  #PUTIN  would  also  be 
programs,  describing  the  sequence  of  steps  to  be  done. 

Earlier  we  asked  about  Sam's  thesis  In  predicate  calculus 
In  PLANNER  we  can  ask: 

(THGOAL  (#ACCEPTABLE  :SAM-THES IS) (THUSE  EVALUATE)) 

Here  we  have  specified  that  our  theorem  EVALUATE  Is  to  be 
used.  If  we  evaluated  this  PLANNER  statement,  the  theorem  would 
be  called,  and  executed  Just  as  described  on  the  previous  pages. 
PLANNER  would  return  one  of  the  values  ”T"  or  "NIL"  depending  on 
whether  the  statement  Is  true  or  false. 

For  a  question  like  "Vlhat  nations  have  never  fought  a  war?" 
PLANNER  has  the  function  THFIND.  We  would  ask; 
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(THFIND  ALL  $?X  (X  Y) 

(THG0AL(#NATI0N  $?X)> 

(THNOT 

(THAND(THGOAL<#WAR  $?Y)) 

(THGOAL(#PARTICIPATED  $?X  $?Y))))) 

and  PLANNER  would  return  a  list  of  all  such  countries.  Using 

our  conventions  for  giving  names  to  relations  and  events,  we 

could  even  ask: 

(THFIND  ALL  $?X  (X  Y  Z  EVENT) 

(THGOAL(#CHICKEN  $?Y)) 
aii60AL(#R0AD  $?Z)) 

(THG0AL(#CR0SS  $?Y  $?Z  $?£VENT)) 

(THGOAL(#CAUSE  $?X  $7EVENT))) 

This  brief  description  has  explained  the  basic  concepts 
underlying  the  deductive  part  of  our  language  understanding 
program.  To  go  with  It,  we  need  a  complex  model  of  the  subject 
being  discussed.  This  Is  described  In  section  3.4.  Section  3.3 
gives  more  details  about  the  PLANNER  lanr.uage  and  Its  uses. 
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3.2  Compart  son  with  Previous  Programs 
In  Section  3.1  we  discussed  ways  of  representing 
Information  and  meaning  within  a  language-compreherid i ng  system. 
In  order  to  compare  our  Ideas  with  those  In  previous  systems,  we 
will  estaollsh  a  broad  classification  of  the  field.  Of  course, 
no  set  of  pigeon-holes  can  completely  characterize  the 
differences  between  programs,  but  they  can  give  us  some 
viewpoints  from  which  to  analyze  different  people's  work,  and 
can  help  us  see  past  the  superficial  differences.  Vie  will  deal 
only  with  the  ways  that  programs  represent  their  Information 
about  the  subject  matter  they  dlscjss.  Issues  such  as  parsing 
and  semantic  analysis  techniques  are  discussed  In  other 
sections.  We  will  distinguish  four  basic  types  of  systems 
called  "special  format",  "text  based",  "restricted  logic",  and 
"general  deductive". 


Section  3.2.1  -  Page  222 


3.2.1  Special  Format  Systems 

Most  of  the  early  language  undestandlng  programs  were  of 
the  special  format  type.  Such  systems  usually  use  two  special 
formats  designed  for  their  particular  subject  matter  one  for 
representing  the  knowledge  they  keep  stored  away#  and  the  other 
for  the  meaning  of  the  English  Input.  Soma  examples  are: 
BASEBALL  <P.F.Green>/  which  stored  tables  of  baseball  results 
and  Interpreted  questions  as  "specification  lists"  requesting 
data  from  those  tables;  SAD  SAM  <LIndsay>,  which  Interpreted 
sentences  as  simple  relationship  fact.s  about  people/  and  stored 
these  In  a  network  structure;  STUDENT  <Bobrow  1964>/  which 
Interpreted  sentences  as  linear  equations  and  could  store  other 
linear  equations  and  manipulate  them  to  solve  algebra  problems; 
and  ELIZA  <Welzenbaum  1966>/  whose  Internal  knowledge  Is  a  set 
of  sentence  rearrangements  and  key  words,  and  which  sees  Input 
as  a  simple  string  of  words. 

These  programs  all  make  the  assumption  that  the  only 
relevant  Information  In  a  sentence  Is  that  which  fits  their 
particular  format.  Although  they  may  have  very  sophisticated 
mechanisms  For  using  this  Information  (as  in  CARPS  <CharnIak>/ 
which  can  solve  word  problems  In  calculus),  they  are  each  built 
for  a  special  purpose,  and  do  not  handle  Information  with  the 
flexibility  which  would  allow  them  to  be  adapted  to  other  uses. 
Nevertheless,  their  restricted  domain  often  allows  them  to  use 
very  clever  tricks,  which  achieve  Impressive  results  with  a 
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minimum  of  concern  for  the  complexities  of  language. 
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2 . 2  Text  pg.sgsl  lyg,t,ef9?. 

Some  researchers  were  not  satisfied  with  the  limitations 
Inherent  In  the  special-format  approach.  They  wanted  systems 
which  were  not  limited  by  their  construction  to  a  particular 
specialized  field.  Instead  they  used  English  text/  with  all  of 
Its  generality  and  diversity/  as  a  basis  for  storing 
Information.  In  these  “text  based*'  systems/  a  body  of  text  Is 
stored  directly/  under  some  sort  of  Indexing  scheme.  An  English 
sentence  Input  to  the  understander  Is  Interpreted  as  a  request 
to  retrieve  a  relevant  sentence  or  group  of  sentences  from  the 
text.  Various  Ingenious  methods  were  used  to  find  possibly 
relevant  sentences  and  decide  which  were  most  likely  to  sastisfy 
the  request. 

PROTOSYNTHEX  I  <Stmmons  ig66>  had  an  Index  specifying  all 
the  places  where  each  “content  word"  was  found  In  the  text.  It 
tried  to  find  the  sentences  which  had  the  most  words  In  common 
with  the  request  (using  a  special  weighting  formula)/  then  did 
some  syntactic  analysis  to  see  whether  the  words  In  common  were 
In  the  right  grammatical  relationship  to  each  other.  Semantic 
Memory  <CIuIIIIan  1966>  stored  a  slightly  processed  version  of 
English  dictionary  definitions  In  which  multiple-meaning  words 
were  eliminated  by  having  humans  Indicate  the  correct 
Interpretation.  It  then  used  an  associative  Indexing  scheme 
which  enabled  the  system  to  follow  a  chain  of  Index  references. 
An  input  request  was  In  the  form  of  two  words  Instead  of  a 
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sentence.  The  response  was  the  shortest  chain  which  connected 
them  through  the  associative  Index  (e.g.  If  there  Is  a 
definition  containing  the  words  A  and  B  and  one  containing  B  and 
C/  a  request  to  relate  A  and  C  will  return  both  sentences). 

Even  with  complex  Indexing  schemes/  the  text  based  approach 
has  a  basic  problem.  It  can  only  spout  back  specific  sentences 
which  have  been  stored  away/  and  can  not  answer  any  question 
which  demands  that  something  be  deduced  from  more  than  one  piece 
of  Information.  In  addition/  Its  responses  often  depend  on  the 
exact  way  the  text  and  questions  are  stated  In  English/  rather 
than  dealing  with  the  underlying  meaning. 


j. 


1 
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3.2.3  Limited  Logic  lys tfima 

The  "limited  logic"  approach  attempted  to  correct  these 
faults  of  text  based  systems/  and  has  been  used  for  most  of  the 
more  recent  language  understanding  programs.  First/  some  sort 
of  mo''e  formal  notation  Is  substituted  for  the  actual  English 
sentences  In  the  base  of  stored  knoledge.  This  notation  may 
take  many  different  formS/  such  as  "desrlptlon  lists"  <Raphael 
1964>/  "kernels"  <Slmmons  1968>  "concept-relatlon-concept 
triples"  <Slmmons  1969>/  "data  nodes"  <QuI11lan  1969>  *  "rings" 
<Thompson>/  "relational  operators"  <Tharp>  /  etc.  Each  of  these 
forms  Is  designed  for  efficient  use  In  a  particular  system/  but 
at  heart  they  are  all  doing  the  same  thing  --  providing  a 
notation  for  simple  assertions  of  the  sort  described  In  section 
3.1.1,  It  Is  relatively  unimportant  which  special  form  Is 
chosen.  All  of  the  different  methods  can  provide  a  uniform 
formalism  which  frees  simple  information  from  being  tied  down  to 
a  specific  way  of  expressing  It  in  English.  Once  this  Is  done/ 
a  system  must  have  a  way  of  translating  from  the  English  Input 
sentences  Into  this  Internal  assertion  format/  and  the  greatest 
bulk  of  the  effort  In  language  understanding  systems  has  been 
this  "semantic  analysis".  We  will  discuss  It  at  length  In 
chapter  4.  For  now  we  are  mere  Interested  In  what  can  be  done 
with  the  assertions  once  they  have  been  put  Into  the  desired 
form. 

Some  systems  (see  <QulI1lan  1969>/  <Tharp>)  remain  close  to 
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text  based  systems^  only  partially  breaking  down  the  Initial 
text  Input.  The  text  Is  processed  by  some  sort  of  dependency 
analysts  and  left  In  a  network  form,  either  emphasizing  semantic 
relationships  or  remaining  closer  to  the  syntactic  dependency 
ana  1 ys I s .  What  Is  common  to  these  systems  Is  that  they  do  not 
attempt  to  answer  questions  from  the  stored  Information.  As 
with  text  based  systems,  they  try  to  answer  by  giving  back  bits 
of  Information  directly  from  the  data  base.  They  may  have 
clever  ways  to  decide  what  parts  of  the  data  are  relevant  to  a 
request,  but  they  do  not  try  to  break  the  question  down  and 
answer  It  by  logical  inference.  Because  of  this,  they  suffer 
the  same  deficiencies  as  text  based  systems.  They  have  a  mass 
of  Information  stored  away,  but  little  way  to  use  It  except  to 
print  I t  back  out. 

Most  of  the  systems  which  have  been  developed  recently  fit 
more  comfortably  under  the  classification  "limited  logic".  In 
addition  to  their  data  base  of  assertions  (whatever  they  are 
called),  they  have  some  mechanism  for  accepting  more  complex 
Information,  and  using  It  to  deduce  the  answers  to  more  complex 
questions.  By  "complex  Information"  we  mean  the  type  of 
knowledge  described  In  section  3.1.3.  This  Includes  knowledge 
containing  logical  quantifiers  and  relationships  (such  as  "Every 
canary  Is  either  yellow  or  purple,"  or  "If  A  Is  a  part  of  B  and 
B  Is  a  part  of  C,  then  A  Is  a  part  of  C.").  By  "complex 
questions",  we  mean  questions  which  are  not  wnswerable  by  giving 
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out  one  of  the  data  base  assertions#  but  demand  some  logical 
Inference  to  produce  an  answer. 

One  cf  the  earliest  limited  logic  programs  was  SIR  <Raphael 
1964>#  which  could  answer  questions  using  simple  logical 
relations  (like  the  ’’part"  example  In  the  previous  paragraph). 
The  complex  Information  was  not  expressed  as  data#  but  was  built 
directly  Into  the  SIR  operating  program.  This  meant  that  the 
types  of  complex  Information  It  could  use  were  highly  limited# 
and  could  not  be  easily  changed  or  expanded.  The  complex 
questions  It  could  answer  were  similar  to  those  In  many  later 
limited  logic  systems#  consisting  of  four  basic  types.  The 
simplest  Is  a  question  which  translates  Into  a  single  assertion 
to  be  verified  or  falsified  (e.g.  "Is  John  a  bagel?")  The  second 
Is  an  assertion  In  which  one  part  Is  left  undetermined  (e.g. 

"Who  Is  a  bagel?")  and  the  system  responds  by  "filling  in  the 
blank".  The  third  type  Is  an  extension  of  this#  which  asks  for 
all  possible  blank-fillers  (e.g.  "Name  all  bagels.")#  and  the 
fourth  adds  counting  to  this  listing  facility  to  answer  count 
questions  (e.g. "How  many  bagels  are  there?").  SIR  had  special 
logic  for  answering  "how  many"  questions#  using  Information  like 
"A  hand  has  5  fingers."#  and  In  a  similar  way  each  limited  logic 
system  had  special  bullt-ln  mechanisms  to  answer  certain  types 
of  questions. 

The  DEACON  system  <Thompson>  had  special  "verb  tables"  to 
handle  time  questions#  and  a  bottom-up  analysis  method  which 


( 
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allowed  questions  to  be  nested.  For  example,  the  question  "Who 
Is  the  commander  of  the  batalllon  at  Fort  Fubar?"  was  handled  by 
first  Internally  answering  the  question  "VJhat  batalllon  is  at 
Fort  Fubar?"  The  answer  was  Chen  substituted  directly  Into  the 
original  question  to  make  It  "Who  Is  the  commander  of  the  69th 
batalllon?",  which  the  system  then  answered.  PROTOSYNTHEX  II 
<Slmmons  1968>  had  special  logic  for  taking  advantage  of  the 
transitivity  of  "Is"  (e.g.  "A  boy  Is  a  person,",  "A  person  Is  an 
animal."  therefore  "A...").  PROTOSYNTHEX  III  <Slmmons  1969>  and 
SAMENLAQ  II  <Shaplro>  bootstrapped  their  way  out  of  first-order 
logic  by  allowing  simple  assertions  about  relationships  (e.g. 
"North-of  Is  the  converse  of  South-of.").  CONVERSE  <Kellogg> 
converted  questions  Into  a  "query  language"  which  allowed  the 
form  of  the  question  to  be  more  complex  but  used  simple  table 
lookup  for  finding  the  answers., 

All  of  the  limited  logic  systems  are  basically  similar.  In 
that  complex  Information  Is  not  part  of  the  data,  but  Is  built 
Into  the  system  programs.  Those  systems  which  could  add  to 
their  Initial  data  base  by  accepting  English  sentences  could 
accept  only  simple  assertions  as  Input.  The  questions  could  not 
Involve  complex  quantified  relationships  (e.g.  "Is  there  a 
country  which  Is  smaller  than  every  U.S.  state?). 
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3.2.4  fianSJlfl).  Deductive  §Y5-lgJ!!,i 

The  pfoblems  of  limited  logic  systems  were  recognized  very 
early  (see  <Raphael  1964>  p.  90),  and  neople  looked  for  a  more 
general  approach  to  storing  and  using  complex  Information.  If 
the  knowledge  could  be  expressed  In  some  standard  mathematical 
notation  (such  as  the  predicate  calculus),  then  all  of  the  work 
logicians  have  done  on  theorem  proving  could  be  utilized  to  make 
an  efficient  deductive  system.  By  expressing  a  question  as  a 
theorem  to  be  proved  (see  section  3,1.3),  the  theorem  prover 
could  actually  deduce  the  Information  needed  to  answer  any 
question  which  could  be  expressed  In  the  formalism.  Complex 
Information  not  easily  useable  In  limited  logic  systems  could  be 
neatly  expressed  In  the  predicate  calculus,  and  a  body  of  work 
already  existed  on  computer  theorem  proving.  This  led  to  the 
"general  deductive"  approach  to  language  understanding  programs. 

The  early  programs  used  logical  systems  less  powerful  than 
the  full  predicate  calculus  (see  <Bar-HI 1 lel>,  <Coles  1968>,  and 
<0arl lngton>}  but  the  big  boost  to  theorem  proving  research  was 
the  development  of  the  Robinson  resolution  algorithm  <Roblnson>, 
a  very  simple  "complete  uniform  proof  procedure"  for  the  first 
oroer  predicate  calculus.  This  meant  that  It  became  easy  to 
write  an  automatic  theorem  proving  program  with  two  Important 
characteristics.  First,  the  procedure  Is  "uniform"  -•  we  need 
not  (and  In  fact,  cannot)  tell  It  how  to  go  about  proving  things 
In  a  way  suited  to  particular  subject  matter.  It  has  Its  own 
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fixed  procedure  for  building  proofs^  and  we  can  only  change  the 
sets  of  logical  statements  (or  "axioms*')  for  It  to  work  on. 
Second^  It  guarantees  that  If  any  proof  Is  possible  using  the 
rules  of  predicate  calculus,  the  procedure  will  eventually  find 
It  (even  though  It  may  take  a  very  long  time).  These  are  very 
pretty  properties  for  an  abstract  deductive  system,  but  the 
question  we  must  ask  is  whether  their  theoretical  beauty  Is 
worth  paying  the  price  of  low  practicality.  We  would  like  to 
argue  that  In  fact  they  have  led  to  the  worst  deficiencies  of 
the  theorem-proving  question-answerers,  and  that  a  very 
different  approach  Is  called  for. 

The  "uniform  procedure"  approach  was  adopted  by  a  number  of 
systems  (see  <Green  1968,  196S>)  as  an  alternative  to  the  kind 
uf  specialized  limited  logic  discussed  In  the  previous  section. 
It  was  felt  that  there  must  be  a  way  to  present  complex 
Information  as  data  rather  than  embedding  It  Into  the  Inner 
workings  of  the  language  understanding  system.  There  are  many 
benefits  In  having  a  uniform  notation  for  representing  problems 
and  knowledge  In  a  way  which  does  not  depend  on  the  quirks  of 
the  particular  program  which  will  Interpret  them.  It  en<i'.bles  a 
user  to  describe  a  body  of  knowledge  to  the  computer  In  a 
"neutral"  way  without  knowing  the  details  of  the  question¬ 
answering  system,  and  guarantees  that  the  system  will  be 
applicable  to  any  subject,  rather  than  being  specialized  to 
handle  only  one. 
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Predicate  calculus  seemed  to  be  a  good  uniform  notation, 
but  In  fact  It  has  a  serious  deficiency.  By  putting  complex 
Information  Into  a  "neutral"  logical  formula,  these  systems 
Ignored  the  fact  that  an  Important  part  of  a  person's  knowledge 
concerns  how  to  go  about  figuring  things  out.  Our  heads  don't 
contain  neat  sets  of  logical  axioms  from  which  we  can  deduce 
everything  through  a  "proof  procedure".  Instead  we  have  a  large 
set  of  heuristics  and  procedures  for  solving  problems  at 
different  levels  of  generality.  Of  course,  there  Is  no  reason 
why  a  computer  should  do  things  the  way  a  person  does,  but  In 
Ignoring  this  type  of  knowledge,  programs  run  Into  tremendous 
problems  of  efficiency.  As  soon  as  a  "uniform  procedure" 
theorem  prover  gets  a  large  set  of  axioms  (even  well  below  the 
number  needed  for  really  understanding  language).  It  become!? 
bogged  down  In  searching  for  a  proof,  since  there  Is  no  easy  way 
to  guide  Its  search  according  to  the  subject  matter.  In 
addition,  a  proof  which  takes  many  steps  (even  If  they  are  In  a 
sequence  which  can  be  easily  predicted  by  the  nature  of  the 
theorem)  may  take  Impossibly  long  since  It  Is  very  difficult  to 
describe  the  correct  proving  procedure  to  the  system. 

It  Is  possible  to  write  theorems  In  a  clever  way  In  order 
to  Implicitly  guide  the  deduction  process,  and  a  recent  paper 
<Green  1969>  describes  some  of  the  problems  In  "techniques  for 
"programming"  In  first-order  logic".  First  order  logic  Is  a 
declarative  rather  than  Imperative  language,  and  to  get  an 
Imperative  effect  (I.e.  telling  It  how  to  go  about  doing 
something)  takes  a  good  deal  of  careful  thought  and  clever 
trickery. 

It  might  be  possible  to  add  strategy  Information  to  a 
predicate  calculus  theorem  prover,  but  with  current  systems  such 
as  QA3,  "To  change  strategies  In  the  current  version,  the  user 
must  know  about  set-of-support  and  other  program  parameters  such 
as  level  bound  and  term-depth  bound.  To  radically  change  the 
strategy,  the  user  presently  has  to  know  the  LISP  language  and 
must  be  able  to  modify  certain  strategy  sections  of  the 
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program."  (<Green  1969>  p.236).  In  newer  programs  such  as  QA4/ 
there  will  be  a  special  strategy  language  to  go  along  with  the 
theorem-proving  mechanisms,  !t  will  be  interesting  to  see  how 
close  these  new  strategy  languages  are  to  PLANNER/  and  whether 
there  is  any  advantage  to  be  gained  by  putting  them  in  a  hybrid 
with  a  resolution-based  system.  As  to  the  completeness 
argument/  there  are  good  reasons  not  to  have  a  complete  system  - 
-  these  are  discussed  later  in  this  section. 


I 
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PrpgedMr.al 

The  problem  with  the  limited  logic  systems  wasn't  the  fact 
that  they  expressed  their  ccmplex  Information  In  the  form  of 
programs  or  procedures.  The  problem  was  that  these  programs 
were  organized  In  such  a  way  that  "...each  change  In  a 
subprogram  may  affect  more  of  the  other  subprograms.  The 
structure  grows  more  awkward  and  difficult  to 
general  I ze. . .FI nal 1 y  the  system  may  become  too  unwieldy  for 
further  experimentation."  (<Raphae1  1964>  p.91).  Nevertheless# 
It  was  necessary  to  build  In  more  and  more  of  these  subprograms 
In  order  to  accept  new  subject  matter. 

What  was  needed  was  the  development  of  new  programming 
techniques  so  that  systems  could  retain  the  capability  of  using 
procedural  Information#  but  at  the  same  time  express  this 
Information  In  a  simple  and  straightforward  way  which  did  not 
depend  on  the  peculiarities  and  special  strucuture  of  a 
particular  program  or  subject  of  discussion. 

A  system  which  partially  fits  this  description  Is  Woods' 
<V>loods  1968>.  It  uses  a  quant! ficatlonal  query  language  for 
expressing  questions#  then  assumes  that  there  are  "semantic 
primitives"  In  the  form  of  LISP  subroutines  which  decide  such 
predicates  as  (CONNEC*''  PLIGHT-23  BOSTON  CHICAGO)  and  which 
evaluate  functions  such  as  "number  of  stops"#  "owner"#  etc.  The 
thing  which  makes  this  system  different  from  the  limited  logic 
systems  Is  that  the  entire  system  was  designed  without  reference 
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to  the  way  the  particular  "primitive”  functions  would  operate  on 
the  data  base.  In  a  way/  this  Is  avoiding  the  Issue/  since  the 
t.iformatlon  which  the  system  was  deslt^ned  to  handle  (the 
Official  Airline  Guide)  Is  particularly  amenable  to  simple 
table~lookup  routines.  If  we  had  to  handle  less  structured 
Information  of  the  type  usually  done  with  theorem  proverS/  these 
primitive  routines  might  Indeed  run  Into  the  same  problems  of 
Interconnectedness  described  In  the  quote  above/  and  would 
become  harder  and  harder  to  generalize. 

PLANNER  was  designed  by  Carl  Hewitt  as  a  goal-oriented 
procedural  language  to  dual  with  these  problems.  It  has  special 
mechanisms  for  dealing  with  assertions  In  an  efficient  way/  and 
In  addition  has  the  capabllty  to  Include  any  complex  Information 
which  can  be  expressed  In  the  predicate  calculus.  More 
Important/  the  complex  Information  Is  expressed  In  the  form  of 
procedures/  which  can  Include  all  sorts  of  knowledge  of  how  to 
best  go  about  proving  things.  The  language  Is  "goal-oriented"/ 
In  that  we  do  not  have  to  be  concerned  about  the  details  of 
Interaction  between  the  different  procedures.  If  at  different 
places  In  our  knowledge  we  have  theorems  which  ask  whether  an 
object  Is  sturdy  (for  example  In  a  theorem  about  support/  about 
building  houses/  etc.)  they  are  not  forced  to  specify  the 
program  which  will  serve  as  sturdiness-inspector.  Instead  they 
say  something  like  "Try  to  find  an  assertion  that  X  Is  sturdy/ 
or  prove  It  using  anything  you  can." 


If  we  know  of  special 
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procedures  which  seem  most  likely  to  give  a  quick  answer#  we  can 
specify  that  these  should  be  tried  first.  But  If  at  some  point 
we  add  a  new  sturdiness-tester#  we  do  not  need  to  find  out  which 
theorems  use  It.  We  need  only  add  It  to  the  data  base#  and  the 
system  will  automatically  try  It  (along  with  any  other 
sturdiness-testers)  whenever  any  theorem  gives  the  go-ahead. 

The  ability  to  add  new  theorems  without  relating  them  to 
other  theorems  Is  the  advantage  of  a  "uniform"  notation.  In 
fact  PLANNER  Is  a  uniform  notation  for  expressing  procedural 
knowledge  just  as  predicate  calculus  !s  a  notation  for  a  more 
limited  range  of  Information.  The  advantage  Is  that  PLANNER  has 
a  hierarchical  control  structure.  In  addition  to  specifying 
logical  relationships#  a  theorem  can  take  over  control  of  the 
deduction  process. 

We  can  have  complete  control  over  how  the  system  will 
operate.  In  any  theorem#  we  can  tell  It  to  try  to  prove  a 
subgoal  using  only  certain  theorems  (If  we  know  that  the  goal  Is 
bound  to  fall  unless  one  of  them  works)#  we  can  tell  It  to  try 
things  In  a  ce.-tain  order  (and  the  choice  of  this  order  can 
depend  on  arbitrarily  complex  calculations  which  take  place  when 
the  subgoal  Is  set  up)  or  we  can  even  write  a  "spoiler"  theorem# 
which  can  tell  the  system  that  a  goal  Is  certain  to  fall#  and 
that  no  other  theorems  should  even  be  tried. 

Notice  that  this  control  structure  makes  It  very  difficult 
to  characterize  the  abstract  logical  properties  of  PLANNER#  such 
as  consistency  and  completeness,  it  Is  worth  pointing  out  here 
that  completeness  may  In  fact  be  a  bad  property.  It  means  (we 
believe#  necessarily)  that  If  the  theorem-prover  Is  given 
something  to  prove  which  Is  In  fact  false#  It  will  exhaust  every 
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possible  way  of  trying  to  prove  It.  By  forsaking  completeness/ 
we  allow  ourselves  to  use  good  sense  In  deciding  when  to  give 
up. 

In  a  truly  uniform  system/  the  theorem  prover  Is  forced  to 
"rediscover  the  world"  every  time  It  answers  a  question.  Every 
goal  forces  It  to  start  from  scratch,  looking  at  all  of  the 
theorems  In  the  data  base  (perhaps  using  some  subject-matter- 
free  heuristics  to  make  a  rough  selection).  Because  It  does  not 
want  to  be  limited  to  domain-dependent  Information,  It  cannot 
use  It  at  all.  PLANNER  can  operate  In  this  "bllndman"  mode  If 
we  ask  It  to  (and  It  Is  less  efficient  at  doing  so  than  a 
procedure  specially  Invented  to  operate  this  way),  but  It  should 
have  to  do  this  only  rarely  —  when  discovering  something  which 
was  not  known  or  understood  when  the  basic  theorems  were 
written.  The  rest  of  the  time  It  can  go  about  proving  things 
which  It  knows  how  to  do.,  without  a  tremendous  overhead  of 
having  to  piece  together  a  proof  from  scratch  each  time.  As 
mentioned  above.  It  might  be  possible  to  patch  "strategy 
programs"  onto  theorems  In  conventional  theorem-provers  In  order 
to  accomplish  the  same  goal.  In  PLANNER  we  have  the  advantage 
that  this  can  be  done  naturally  using  the  notation,  and  the 
strategy  Is  embedded  In  the  PLANNER  theorems,  which  themselves 
can  be  looked  at  as  data.  In  an  advanced  system  a  PLANNER 
program  could  be  written  to  learn  from  experience.  Once  the 
"blindnr.an  mode"  finds  a  proof,  the  method  It  used  could  be 
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remembered  and  tried  first  when  a  similar  goal  Is  generated 
again.  See  section  5.1  for  more  discussion  cf  learning. 

To  those  accustomed  to  uniform  proof  procedures,  this  all 
sounds  like  cheating.  Is  the  system  really  proving  anything  If 
you  are  giving  It  clues  about  what  to  do?  Why  Is  It  different 
from  a  simple  set  of  programmed  LISP  procedures  like  those 
envisioned  by  Woods?  First,  the  language  Is  designed  so  that 
theorems  can  be  written  Independently  of  each  other,  wItho'Jt 
worrying  about  when  they  will  be  called,  or  what  other  theorems 
and  data  they  will  need  to  prove  their  subgoals. 

The  language  Is  designed  so  that  If  we  want,  we  can  write 
theorems  In  a  form  which  Is  almost  Identical  to  the  predicate 
calculus,  so  we  have  the  benefits  of  a  uniform  system.  On  the 
other  hand,  we  have  the  capability  to  add  as  much  subject- 
dependent  knowledge  as  we  want,  telling  theorems  about  other 
theorems  and  proof  procedures.  The  system  has  an  automatic 
goal-tree  backup  system,  so  that  even  when  we  are  specifying  a 
particular  order  In  which  to  do  things,  we  may  not  know  how  the 
system  will  go  about  doing  them.  It  will  be  able  to  follow  our 
suggestions  and  try  many  different  theorems  to  establish  a  goal, 
backing  up  and  trying  another  automatically  If  one  of  them  leads 
to  a  failure  (see  section  3.3). 

In  summary,  the  main  advance  In  a  deductive  system  using 
PLANNER  Is  In  allowing  ourselves  to  have  a  data  base  of 
procedures  rather  than  formulas  to  express  complex  Information. 
This  combines  the  generality  and  power  of  a  theorem  prover  with 
the  ability  to  accept  procedural  knowledge  and  heuristics 
relevant  to  the  data,  it  provides  a  flexible  and  powerful  tool 
to  serve  as  the  basis  for  a  language  understanding  system.  The 
rest  of  this  chapter  describes  the  PLANNER  language  and  the  way 
It  Is  used  In  our  system. 
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3.3  Programming  In  PLANNER 

l.Tul  Basic  Qpgr^tlQP  Si  PLAMES 

The  easiest  way  to  understand  PLANNER  Is  to  watch  how  It 
works,  so  In  this  section  we  will  present  a  few  simple  examples 
and  explain  the  use  of  some  of  Its  most  elementary  features. 

First  we  will  take  the  most  venerable  of  traditional 
deductions: 

Turing  Is  a  human 

All  humans  are  fallible 
so 

Turing  Is  fallible. 

It  Is  easy  enough  to  see  how  this  could  be  expressed  In  the 
usual  logical  notation  and  handled  by  a  uniform  proof  procedure. 
Instead,  let  us  express  It  In  one  possible  way  to  PLANNER  by 
saying: 

(THASSERT  (HUMAN  TURING)) 

;Thls  asserts  that  Turing  Is  human. 

(DEFPROP  THEOREM! 

(THCONSE  (X)  (FALLIBLE  $?X) 

(THGOAL  (HUMAN  $?X))) 

THEOREM) 

;ThIs  Is  one  way  of  saying  that  all  humans 
;are  fal 1 Ible. 

The  proof  would  be  generated  bv  asking  PLANNER  to  evaluate 
the  expression: 

(THGOAL  (FALLIBLE  TURING)  (THTBF  THTRUE)) 

We  Immediately  see  several  points.  First,  there  are  two 
different  ways  of  storing  Information.  Simple  assertions  are 
stored  In  a  data  base  of  assertions,  while  more  complex 
sentences  containing  quantifiers  or  logical  connectives  are 
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expressed  In  the  form  of  theorems. 

Second/  one  of  the  most  Important  points  about  PLANNER  Is 
that  It  Is  an  evaluator  for  statements  written  In  a  programming 
language.  It  accepts  input  In  the  form  of  expressions  written 
In  the  PLANNER  language,  and  evaluates  them,  producing  a  value 
and  side  effects.  THASSERT  Is  a  function  which,  when  evaluated, 
stores  Its  argument  In  the  data  base  of  assertions  or  the  data 
base  of  theorems  (which  are  cross-referenced  In  various  ways  to 
give  the  system  efficient  look-up  capabilities).  A  theorem  Is 
defined  with  DEFPP.OP  as  are  functions  In  LISP. 

In  this  example  we  have  defined  a  theorem  of  the  THCONSE 
type  (THCONSE  means  consequent;  we  will  see  other  types  later). 
This  states  that  If  we  ever  want  to  establish  a  goal  of  the  form 
(FALLIBLE  $?X),  we  can  do  this  by  accomplishing  the  goal  (HUMAN 
$?X),  where  X  Is  a  variable.  The  strange  prefix  characters  are 
part  of  planner's  pattern  matching  capabilities.  If  we  ask 
PLANNER  to  prove  a  goal  of  the  form  (A  X),  there  is  no  obvious 
way  of  knowing  whether  A  and  X  are  constants  (like  TURING  and 
HUMAN  In  the  example)  or  variables.  LISP  solves  this  problem  by 
using  the  function  QUOTE  to  Indicate  constants.  In  pattern 
matching  this  Is  Inconvenient  and  makes  most  patterns  much 
bulkier  and  more  difficult  to  read.  Instead,  PLANNER  uses  the 
opposite  convention  --  a  constant  Is  represented  by  the  atom 
Itself,  while  a  variable  must  be  Indicated  by  adding  an 
appropriate  prefix.  This  prefix  differs  according  to  the  exact 
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use  of  the  variable  In  the  pattern,  but  for  the  time  being  let 
us  Just  accept  $?  as  a  prefix  Indicating  a  variable.  The 
definition  of  the  theorem  Indicates  that  It  has  one  variable,  X, 
by  the  (X)  following  THCONSE. 

The  third  statement  Illustrates  the  function  THGOAL,  v/hich 
calls  the  PLANNER  Interpreter  to  try  to  prove  an  assertion. 

This  can  function  In  several  ways.  If  we  had  asked  PLANNER  to 
evaluate  (THGOAL  (HUMAN  TURING))  It  would  have  found  the 
requested  assertion  Immediately  In  the  data  base  and  succeeded 
(returning  as  Its  value  some  Indicator  that  It  had  succeeded). 
However,  (FALLIBLE  TURING)  has  not  been  asserted,  so  we  must 
resort  to  theorems  to  prove  It. 

Later  we  will  see  that  a  THGOAL  statement  can  give  PLANNER 
various  kinds  of  advice  on  which  theorems  are  applicable  to  the 
goal  and  should  be  tried.  For  the  moment,  (THTBF  THTRUE)  Is 
advice  that  causes  the  evaluator  to  try  all  theorems  whose 
consequent  I s  of  a  form  which  matches  the  goal.  (I.e.  a  theorem 
with  a  consequent  ($?Z  TURING)  would  be  tried,  but  one  of  the 
form  (HAPPY  $?Z)  or  (FALLIBLE  $?Y  $?Z)  would  not.  Assertions 
can  have  an  arbitrary  list  structure  for  their  format  —  they 
are  not  limited  to  two-member  lists  or  three-member  lists  as  In 
these  examples.)  The  theorem  we  have  just  defined  would  be 
found,  and  In  trying  It,  the  match  of  the  consequent  to  the 
goal  would  cause  the  variable  $?X  to  be  assigned  to  the  constant 
TURING.  Therefore,  the  theorem  sets  up  a  new  goal  (HUMAN 
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TURING)  and  this  succeeds  Immediately  since  It  Is  In  the  data 
base.  In  general,  the  success  of  a  theorem  will  depend  on 
evaluating  a  PLANNER  program  of  arbitrary  complexity.  In  this 
case  It  contains  only  a  single  THGOAL  statement,  so  Its  success 
causes  the  entire  theorem  to  succeed,  and  the  goal  (FALLIBLE 
TURING)  Is  proved. 

Consider  the  question  "Is  anything  fallible?",  or  In 
logic,  (EXISTS  (Y)(FALLIBLE  Y)).  This  requires  a  variable  and 
It  could  be  expressed  In  PLANNER  as: 

(THPROG  (Y)  (THGOAL  (FALLIBLE  $?Y)(THTBF  THTRUE))) 

Notice  that  THPROG  (PLANNER'S  equivalent  of  a  LISP  PROG, 
complete  with  GO  statements,  tags,  RETURN,  etc.)  acts  as  an 
existential  quantifier.  It  provides  a  binding-place  for  the 
variable  Y,  but  does  not  lnltl_llze  It  --  It  leaves  It  In  a 
state  particularly  marked  as  unassIgned.  To  answer  the 
question,  we  ask  PLANNER  to  evaluate  the  entire  THPROG 
expression  above.  To  do  this  It  starts  by  evaluating  the  THGOAL 
expression.  This  searches  the  data  base  for  an  assertion  of  the 
form  (FALLIBLE  $?Y)  and  falls.  It  then  looks  for  a  theorem  with 
a  consequent  of  that  form,  since  the  recommendation  (THTBF 
THTRUE)  says  to  look  at  all  possible  theorems  which  might  be 
applicable.  When  the  theorem  defined  above  Is  called,  the 
variable  X  In  the  theorem  Is  Identified  with  the  variable  Y  In 
the  goal,  but  since  Y  has  no  value  yet,  X  does  not  receive  a 
value.  The  theorem  then  sets  up  the  goal  (HUMAN  $?X)  with  X  as 
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a  variable.  The  data-base  searching  mechanism  takes  this  as  a 
command  to  look  for  any  assertion  which  matches  that  pattern 
(I.e.  an  Instantiation),  and  finds  the  assertion  (HUMAN  TURING). 
This  causes  X  (and  therefore  Y)  to  be  assigned  to  the  constant 
TURING,  and  the  theorem  succeeds,  completing  the  proof  and 
returning  the  value  (FALLIBLE  TURING). 
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There  seems  to  be  something  missing.  So  far#  the  data  base 
has  contained  only  the  relevant  objects#  and  therefore  PLANNER 
has  found  the  right  assertions  Immediately.  Consider  the 
problem  we  would  get  if  we  added  new  Information  by  evaluating 
the  statements: 

(THASSERT  (HUMAN  SOCRATES)) 

(THASSERT  (GREEK  SOCRATES)) 

Our  data  base  now  contains  the  assertions: 

(HUMAN  TURING) 

(HUMAN  SOCRATES) 

(GREEK  SOCRATES) 

and  the  theorem: 

(THCONSE  (X)  (FALLIBLE  $?X) 

(THGOAL  (HUMAN  $?X))) 

What  If  we  now  ask#  "Is  there  a  fallible  Greek?"  In  PLANNER 
we  would  do  this  by  evaluating  the  expression: 

(THPROG  (X)  (THGOAL  (FALLIBLE  $?X)(THTBF  THTRUE)) 

(THGOAL  (GREEK  $?X))) 

THPROG  acts  like  an  AND#  Insisting  that  all  of  Its  terms 
are  satisfied  before  the  THPROG  Is  happy.  Notice  what  might 
happen.  The  first  THGOAL  may  be  satisfied  by  the  exact  same 
deduction  as  before#  since  we  have  not  removed  Information.  If 
the  data-base  searcher  happens  to  run  Into  TURING  before  It 
finds  SOCRATES#  the  goal  (HUMAN  $?X)  will  succeed#  assigning  $?X 
to  TURING.  After  (FALLIBLE  $?X)  succecs#  the  THPROG  will  then 
establish  the  new  goal  (GREEK  TURING)#  which  Is  doomed  to  fall 
since  It  has  not  been  asserted#  and  there  are  no  applicable 


Section  3.3.2  -  Page  2US 


theorems.  If  we  think  In  LISP  terms/  this  Is  a  serious  problem, 
since  the  evaluation  of  the  first  THGOAL  has  been  completed 
before  the  second  one  Is  called,  and  the  "push-down  list"  now 
contains  only  the  THPROG.  If  we  try  to  go  back  to  the  beginning 
and  start  over.  It  will  again  find  TURING  and  so  on,  ad 
I nf I n I  turn. 

One  of  the  most  Important  features  of  the  PLANNER  language 
Is  that  backup  In  case  of  failure  Is  always  possible,  and 
moreover  this  backup  can  go  to  the  last  place  where  a  decision 
of  any  sort  was  made.  Here,  the  decision  was  to  pick  a 
particular  assertion  from  the  data  base  to  match  a  goal.  Other 
decisions  might  be  the  choice  of  a  theorem  to  satisfy  a  goal,  or 
a  decision  of  other  ty'>es  found  in  more  complex  PLANNER 
functions  such  as  THOR  (the  equivalent  of  LISP  OR).  PLANNER 
keeps  enough  information  to  change  any  decision  and  send 
evaluation  back  down  a  new  path. 

In  our  example  the  decision  was  made  Inside  the  theorem  for 
FALLIBLE,  when  the  goal  (HUMAN  $?X}  was  matched  to  the  assertion 
(HUMAN  TURING).  PLANNER  will  retrace  Its  steps,  try  to  find  a 
different  assertion  which  matches  the  goal,  find  (HUMAN 
SOCRATES),  and  continue  with  the  proof.  The  theorem  will 
succeed  with  the  value  (FALLIBLE  SOCRATES),  and  the  THPROG  will 
proceed  to  the  next  expression,  (THGOAL  (GREEK  $?X)).  Since  X 
has  been  assigned  to  SOCRATES,  this  will  set  up  the  goal  (GREEK 
SOCRATES)  which  will  succeed  Immediately  by  finding  the 
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corresponding  assertion  In  the  data  base.  Since  there  are  no 
more  expressions  In  the  THPROG,  It  will  succeed,  returning  as 
Its  value  the  value  of  the  last  expression,  (GREEK  SOCRATES). 

The  whole  course  of  the  deduction  process  depends  on  the  failure 
mechanism  for  backing  up  and  trying  things  over  (this  Is 
actually  the  process  of  trying  different  branches  down  the 
subgoal  tree.)  All  of  the  functions  like  THCOND,  THAND,  THOR, 
etc,  are  controlled  by  success  vs.  failure.  Thus  It  Is  the 
PLANNER  executive  which  establishes  and  manipulates  subgoals  In 
looking  for  a  proof. 
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?!?■?,  Differences  w? th  Other  Theorem-Provers  and  Langua;;es 

Although  PLANNER  Is  written  as  a  programming  language.  It 
differs  In  several  critical  ways  from  anywhlng  which  Is  normally 
comldered  a  programming  language.  First,  It  Is  goal -dl  rected. 
Theorems  can  be  thought  of  as  subroutines,  but  they  can  be 
called  through  a  very  general  pattern-matcher  which  looks  at  the 
goal  which  Is  to  be  satisfied.  This  Is  like  having  the  abllltly 
to  say  "Call  a  subroutine  which  will  achieve  the  desired  result 
at  this  point."  Second,  the  evaluator  has  the  mechanism  of 
success  and  failure  to  handle  the  exploration  of  the  subgoal 
tree.  Other  languages,  such  as  LISP,  with  a  basic  recursive 
evaluator  have  no  way  to  do  this.  Third,  PLANNER  contains  a 
bookkeeping  system  for  matching  patterns  and  manipulating  a  data 
base,  and  for  handling  that  data  base  efficiently. 

How  Is  PLANNER  different  from  a  theorem  prover?  What  Is 
gained  by  writing  theorems  In  the  form  of  programs,  and  giving 
them  power  to  call  other  programs  which  manipulate  data?  The 
key  Is  In  the  form  of  the  data  the  theorem-prover  can  accept. 
Most  systems  take  decl arat I ve  Information,  as  In  predicate 
calculus.  This  Is  In  the  form  of  expressions  which  represent 
"facts"  about  the  world.  These  are  manipulated  by  the  theorem- 
prover  according  to  some  fixed  uniform  process  set  by  the 
system.  PLANNER  can  make  use  of  Imperative  Information, 
telling  It  how  to  go  about  proving  a  subgoal,  or  to  make  use  of 
an  assertion.  This  produces  what  Is  called  hierarchical  control 
structure.  That  Is,  any  theorem  can  Indicate  what  the  theorem 
prover  Is  supposed  to  do  as  It  continues  tha  proof.  It  has  the 
full  power  of  a  general  programming  language  to  evaluate 
functions  which  can  depend  on  both  the  data  base  and  the  subgoal 
tree,  and  to  use  Its  results  to  control  the  further  proof  by 
making  assertions,  deciding  what  theorems  are  to  be  used,  and 
specifying  a  sequence  of  steps  to  be  followed. 

What  does  this  mean  In  practical  terms?  In  what  way  does 
It  make  a  "better"  theorem  prover?  We  will  give  several 
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examples  of  areas  where  the  appro;'ch  Is  Important. 

First,  consider  the  basic  problem  of  deciding  what  subgoals 
to  try  In  attempting  'to  satisfy  a  goal.  Very  often,  knowledge 
of  the  subject  matter  will  tell  us  that  certain  methods  are  very 
likely  to  succeed,  others  may  be  useful  If  certain  other 
conditions  are  present,  while  others  may  be  possibly  valuable, 
but  not  likely.  We  would  like  to  have  the  ability  to  use 
heuristic  programs  to  determine  these  facts  and  direct  the 
theorem  prover  accordingly.  It  should  be  able  to  direct  the 
search  for  goals  and  solutions  In  the  best  way  possible,  and 
able  to  bring  as  much  Intelligence  as  possible  to  bear  on  the 
decision.  In  PLANNER  this  Is  done  by  adding  to  our  THGOAL 
statement  a  recommendation  list  which  can  specify  that  ONLY 
certain  theorems  are  to  be  tried,  or  that  certain  ones  are  to  be 
tried  FIRST  In  a  specified  order.  Since  theorems  are  programs, 
subroutines  of  any  type  can  be  called  to  help  make  this  decision 
before  establishing  a  new  THGOAL.  Each  theorem  has  a  name  (In 
our  definition  at  the  beginning  of  Section  3.1.1,  the  theorem 
was  given  the  name  THEOREMl),  to  facilitate  referring  to  It 
explicitly. 

The  simplest  kind  of  reconmiendatlon  Is  THOSE,  which  takes  a 
list  of  theorems  (by  names)  and  recommends  that  they  be  tried  In 
the  order  listed.  A  more  general  recommendation  uses  filters 
which  look  at  the  theorem  and  decide  whether  It  should  be  tried. 
The  user  defines  his  own  filters,  except  for  the  standard  filter 
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THTRUE/  which  accepts  any  theorem. 

The  filter  command  for  theorems  Is  THTBF,  so  a 
recommendation  list  of  the  form: 

((THUSE  THl  TH2)  (THTBF  TESTXTHUSE  TH-DESPERATl  ON) ) 
would  mean  to  first  try  the  theorem  named  THl,  then  TH2,  then 
any  theorem  which  passes  the  filter  named  TEST  (which  the  user 
would  define),  then  If  all  that  falls,  use  the  theorem  named  TH- 
DESPERATION.  In  our  programs,  we  have  made  use  of  only  the 
simple  capabilities  for  choosing  theorems  --  we  do  not  define 
filters  other  than  THTRUE.  However,  there  !s  also  a  capability 
for  filtering  assertions  In  a  similar  way,  and  we  do  use  this, 
as  explained  In  section  4.3. 


r 


I 
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3.3.4  Control  1  Ing 

An  Important  problem  Is  that  of  maintaining  a  data  base 
with  a  reasonable  amount  of  material.  Consider  the  first 
example  above.  The  statement  that  all  humans  are  fallible/ 
while  unambiguous  in  a  declarative  sense  Is  actually  ambiguous 
In  its  imperative  sense  (i.e.  the  way  it  Is  to  be  used  by  the 
theorem  prover).  The  first  way  Is  to  simply  use  It  whenever  we 
are  faced  with  the  need  to  prove  (FALLIBLE  $?X).  Another  way 
might  be  to  watch  for  a  statement  of  the  form  (HUMAN  $?X)  to  be 
asserted/  and  to  Immediately  assert  (FALLIBLE  $?X)  as  well. 

There  is  no  abstract  logical  difference/  but  the  Impact  on  the 
data  base  is  tremendous.  The  more  conclusions  we  draw  when 
Information  Is  asserted/  the  easier  proofs  will  be/  since  they 
will  not  have  to  make  the  additional  steps  to  deduce  these 
consequences  over  and  over  again.  However  since  we  don't  have 
Infinite  speed  and  size/  It  Is  clearly  folly  to  think  of 
deducing  and  asserting  everything  possible  (or  even  everything 
interesting)  about  the  data  when  It  Is  entered.  If  we  were 
working  with  totally  abstract  meaningless  theorems  and  axioms 
(an  assumption  which  would  not  be  Incompatible  with  many 
theorem-proving  scht^mes}/  this  would  be  an  Insoluble  dilemma. 

But  PLANNER  Is  designed  to  work  In  the  real  world/  where  our 
knowledge  Is  much  more  structured  than  a  set  of  axioms  and  rules 
of  Inference.  We  may  very  well/  when  we  assert  (LIKES  $?X 
POETRY)  want  to  deduce  and  assert  (HUMAN  $?X)/  since  In 
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deducing  things  about  an  object/  It  will  very  often  be  relevant 
whether  that  object  Is  human,  and  we  shouldn't  need  to  deduce  It 
each  time.  On  the  other  hand.  It  would  be  sMIy  to  assert  (HAS- 
AS-PART  $?X  SPLEEN),  since  there  Is  a  horde  of  facts  equally 
Important  and  equally  limited  In  use.  Part  of  the  knowledge 
which  PLANNER  should  have  of  a  subject,  then.  Is  what  facts  are 
Important,  and  when  to  draw  consequences  of  an  assertion.  This 
Is  done  by  having  theorems  of  an  antecedent  type: 

(DEFPROP  THE0REM2 

(THANTE  (X  Y)  (LIKES  $?X  $?Y) 

(THASSERT  (HUMAN  $?X))) 

THEOREM) 

This  says  that  when  we  assert  that  X  likes  something,  we 
should  also  assert  (HUMAN  $(X).  Of  course,  such  theorems  do  not 
have  to  be  so  simple.  A  fully  general  PLANNER  program  can  be 
activated  by  an  THANTE  theorem,  doing  an  arbitrary  (that  Is,  the 
programmer  has  free  choice)  amount  of  deduction,  asssertlon, 
etc.  Knowledge  of  what  we  are  doing  In  a  particular  problem  may 
Indicate  that  It  Is  sometimes  a  good  Idea  to  do  this  kind  of 
deduction,  and  othe.*  times  not.  As  with  the  CONSEQUENT 
theorems,  PLANNER  has  the  full  capacity  when  something  Is 
asserted,  *‘o  evaluate  the  current  state  of  the  data  and  proof, 
and  specifically  decide  which  ANTECEDENT  theorems  should  be 
called. 

PLANNER  therefore  allows  deductions  to  use  all  sorts  of 
knowledge  about  the  subject  matter  which  go  far  beyond  the  set 
of  axioms  and  basic  deductive  rules,  PLANNER  Itself  Is  subject- 
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Independent,  but  Its  power  Is  that  the  deduction  processs  never 
needs  to  operate  on  such  a  level  of  Ignorance.  The  programmer 
can  put  In  as  much  heuristic  knowledge  as  he  wants  to  about  the 
subject,  just  as  a  good  teacher  would  help  a  class  to  understand 
a  mathematical  theory,  rather  than  jus:  telling  them  the  axioms 
and  then  giving  theorems  to  prove. 
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JLliui  ^Y^ntS  SiM  States 

Another  advantage  In  representing  knowledge  In  an 
Imperative  form  Is  the  use  of  a  theorem  prover  In  aeallng  with 
processes  Involving  a  sequence  of  events.  Consider  the  case  of 
a  robot  manipulating  blocks  on  a  table.  It  might  have  data  of 
the  form,  "blockl  Is  on  block2,"  "block2  Is  behind  block3"/  and 
"If  X  Is  on  y  and  you  put  It  on  z,  then  x  Is  on  z,  and  Is  no 
longer  on  y  unless  y  Is  the  same  as  z”.  Many  examples  In  papers 
on  theorem  provers  are  of  this  form  (for  example  the  classic 
"monkey  and  bananas"  problem).  The  problem  Is  that  a 
declarative  theorem  prover  cannot  accept  a  statement  like  (ON  B1 

B2)  at  face  value.  It  clearly  Is  not  an  axiom  of  the  system, 

since  Its  validity  will  change  as  the  process  goes  on.  It  must 
be  put  In  a  form  (ON  B1  B2  SO)  where  SO  Is  a  symbol  for  an 
Initial  state  of  the  world.  See  <Green  1969>  for  a  discussion 
of  such  "state"  problems. 

The  third  statement  might  be  expressed  as: 

(FORALl  (X  Y  Z  S)(AUD  (ON  X  Y  (PUT  X  Y  S)) 

(0R(EQUAL  Y  Z) 

(N0T(0N  X  Z  (PUT  X  Y  S)))))) 

In  this  representation,  PUT  Is  c  function  whose  value  Is 
the  state  which  results  from  putting  X  on  Y  when  the  previous 
state  was  S.  We  run  Into  a  problem  when  v/e  try  to  ask  (CN  Z  W 

(PUT  X  Y  S))  I.e.  Is  block  Z  on  block  W  after  we  put  X  on  Y?  A 

human  knows  that  If  we  haven't  touched  Z  or  W  we  could  just  ask 
(ON  Z  W  >)  but  In  general  It  may  take  a  complex  deduction  to 


f 
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decide  whether  we  have  actually  moved  them,  and  even  If  we 
haven't.  It  will  take  a  whole  chain  of  deductions  (tracing  back 
through  the  time  sequence)  to  prove  they  haven't  been  moved.  In 
PLANNER,  where  we  specify  a  process  directly,  this  whole  type  of 
problem  can  be  handled  In  an  Intuitively  more  satisfactory  way 
by  using  the  primitive  function  THERASE. 

Evaluating  (THERASE  (ON  $?X  $?Y))  removes  the  assertion  (ON 
$?X  $?Y)  from  the  data  base.  If  we  think  of  theorem  provers  as 
working  with  a  set  of  axioms.  It  seems  strange  to  have  a 
function  whose  purpose  Is  to  erase  axioms.  If  Instead  we  think 
of  the  data  base  as  the  "state  of  the  world''  and  the  operation 
of  the  prover  as  manipulating  that  state.  It  allows  us  to  make 
great  simplifications.  Now  we  can  simply  assert  (ON  B1  B2} 
without  any  explicit  mention  of  states.  We  can  express  the 
necessary  theorem  as: 

(DEFPROP  THE0REM3 

(THCONSE  (X  Y  Z)  (PUT  $?X  $?Y) 

(THGOAL  (ON  $?X  $?Z)) 

(THERASE  (ON  $7X  $?Z)) 

(THASSERT  (ON  $?X  $?Y))) 

THEOREM) 

This  says  that  whenever  we  want  to  satisfy  a  goal  of  the 
form  (PUT  $?X  $?Y),  we  should  first  find  out  what  thing  Z  the 
thing  X  Is  sitting  on,  erase  the  fact  that  It  Is  sitting  on  Z, 
and  assert  that  It  Is  sitting  on  Y.  We  could  also  do  a  number  of 
other  things,  such  as  proving  that  It  Is  Indeed  possible  to  put 
X  on  Y,  or  adding  a  list  of  specific  Instructions  to  a  movement 
plan  for  an  arm  to  actually  execute  the  goal. In  a  more  complex 
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case/  other  Interactions  might  be  Involved.  For  example#  If  we 
are  keeping  assertions  of  the  form  (ABOVE  $?X  $?Y)  we  would  need 
i  to  delete  those  assertions  which  became  false  when  we  erased  (ON 

I  $?X  $?Z)  and  add  those  which  became  true  when  we  added  (ON  $?X 

$?Y).  ANTECEDENT  theorems  would  be  called  by  the  assertion  (ON 
$?X  $?Y)  to  take  care  of  that  part#  and  a  similar  group  called 
ERASING  theorems  can  be  called  In  an  exactly  analogous  way  when 
an  assertion  Is  erased#  to  derive  consequences  of  the  erasure. 
Again  we  emphasize  that  which  of  such  theorems  would  be  called 
is  dependent  on  the  way  the  data  base  Is  structured#  and  Is 
determined  by  knowledge  of  the  subject  matter.  In  this  example# 
we  would  have  to  decide  whether  It  was  worth  adding  all  of  ',:he 
ABOVE  relations  to  the  data  base#  with  the  resultant  need  to 
check  them  whenever  something  Is  moved#  or  Instead  to  omit  them 
and  take  time  to  deduce  them  from  the  ON  relation  each  time  they 
are  needed. 

Thus  In  PLANNER#  the  changing  state  of  the  world  can  be 
mirrored  In  the  changing  state  of  the  data  base#  avoiding  any 

i 

[  need  to  make  explicit  mention  of  states#  with  the  requisite 

I 

overhead  of  deductions.  This  Is  possible  since  the  Information 

j  Is  given  In  an  Imperative  form#  specifying  theorems  as  a  series 

I 

of  specific  steps  to  be  executed. 

If  we  look  back  to  the  distinction  between  assertions  and 
■  theorems  mr.de  on  the  first  page#  It  would  seem  that  we  have 

established  that  the  base  of  assertions  Is  the  "current  state  of 
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the  world”,  while  the  base  of  theorems  Is  our  permanent 
knowledge  of  how  to  deduce  things  from  that  state.  This  Is  not 
exactly  true,  and  one  of  the  most  exciting  possibilities  In 
PLANNER  Is  the  capability  for  the  program  Itself  to  create  and 
modify  the  PLANNER  functions  which  make  up  the  theorem  base. 
Rather  than  simply  making  assertions,  a  particular  PLANNER 
function  might  be  written  to  put  .together  a  new  theorem  or  make 
changes  to  an  existing  theorem,  in  a  way  dependent  on  the  data 
and  current  knowledge.  It  seems  likely  that  meaningful 
•'learning”  Involves  this  type  of  behavior  rather  than  simply 
modifying  parameters  or  adding  more  individual  facts 
(assertions)  to  a  declarative  data  base. 
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P_LANNER  fuQaLLaaa 

There  are  a  number  of  other  PLANNER  commands/  designed  to 
salt  a  range  of  problem-sol vl ng  needs.  They  are  described  In 
detail  in  <Hewltt  1969/  1970>  and  We  will  describe  only  those 
which  are  of  particular  use  In  our  question  answering  program 
and  which  we  will  want  to  refer  to  later. 

We  have  already  mentioned  the  basic  functions  and  described 
how  they  operate.  THGOAL  looks  for  assertions  In  the  data  base/ 
and  calls  theorems  to  achieve  goals.  THAND  takes  a  list  of 
PLANNER  expressions  and  succeeds  only  If  they  all  succeed  In  the 
order  they  are  listed.  THOR  takes  a  similar  list  and  tries  the 
expressions  In  order/  but  succeeds  as  soon  ai.  one  of  them  does. 
Remember  that  In  case  of  a  failure  farther  along  In  the 
deduction/  THOR  can  take  back  Its  decision  and  continue  on  down 
the  list.  The  other  simple  Ll'SP  functions  PROG/  COND/  and  NOT 
have  their  PLANNER  analogs/  THPROG/  THCOND/  and  THNOT/  which 
operate  just  as  their  LISP  counterparts/  except  that  they  are 
controlled  by  the  distinction  between  "failure"  and  "success" 
Instead  of  the  distinction  between  NIL  and  non-NIL.  THPROG  acts 
like  THAND/  falling  If  any  one  of  Its  members  falls. 

One  of  the  most  useful  PLANNER  functions  Is  THFIND/  which 
Is  used  to  find  all  of  the  objects  or  assertions  satisfying  a 
given  PLANNER  condition.  For  example/  If  we  want  to  find  all  of 
the  red  blocks/  we  can  evaluatet 
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(THFIND  ALL  $?X  (X) 

(THG0AL(BL0CK  $?X)) 

(THGOALCCOLOR  $?X  RED))) 

The  function  THFIND  takes  four  pieces  of  Information. 

First,  there  Is  a  parameter,  telling  It  how  many  objects  to  look 
for.  When  we  use  ALL,  it  looks  for  as  many  as  it  can  find,  and 
succeeds  If  It  finds  any.  If  we  use  an  Integer,  It  succeeds  as 
soon  as  It  finds  that  many,  without  looking  for  more.  If  we 
want  to  be  more  complex,  we  can  tell  It  three  things:  a.  how 
many  It  needs  to  succeed;  b.  how  many  It  needs  to  quit  looking, 
and  c.  whether  to  succeed  or  fall  If  It  reaches  the  upper  limit 
set  In  b. 

Thus  If  we  want  to  find  exactly  3  objects,  we  can  use  a 
parameter  of  (3  4  NIL),  which  means  "Don't  succeed  unless  there 
are  three,  look  for  a  fourth,  but  If  you  find  It,  fall". 

The  second  bit  of  Information  tells  It  what  we  want  In  the 
list  It  returns.  For  our  purposes,  this  will  alwcys  be  the 
variable  name  of  the  object  we  are  Interested  In.  The  third 
Item  Is  a  list  of  variables  to  be  used,  and  the  fourth  Is  the 
body  of  the  THFIND  statement.  It  Is  this  body  that  must  be 
satisfied  for  each  object  found.  It  Is  Identical  to  the  body  of 
a  THPROG,  and  can  have  tags  and  THGO  statements  as  well  as  a 
series  of  expressions  to  be  evaluated. 

Another  function  used  extensively  by  the  semantic 
Interpreter  Is  THAMONG.  This  takes  two  arguments,  the  second  Is 
a  list,  and  the  first  Is  the  name  of  a  variable.  If  the 
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variable  Is  assigned^  THAMONG  acts  just  like  LISP  MEMQ^ 
succeeding  If  the  value  of  the  variable  Is  contained  In  the 
list.  However#  If  the  variable  Is  unassIgned#  THAMONG  assigns 
It  to  the  first  member  of  the  list#  then  succeeds.  If  this 
causes  a  failure  to  back  tp  to  the  THAMONG#  It  binds  the 
variable  instead  to  the  second  member  and  tries  again.  This 
continues  until  the  entire  expression  succeeds  with  some 
assignment  or  the  list  Is  exhausted#  In  which  case  THAMONG 
returns  failure.  Using  this#  along  with  the  normal  binding 
mechanism  In  THGOAL  statements#  failure  can  be  used  to  run  a 
loop  through  a  list  of  objects  which  are  specified  by  giving  a 
PLANNER  goal  or  arbitrary  expression  which  they  satsify. 


k 
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3.4  The  BLOCKS  World 

We  need  a  subject  to  discuss  with  dur  language* 
understanding  program  which  gives  a  variety  of  things  to  say  and 
In  which  we  can  carry  on  a  discourse,  containing  statements, 
questions,  and  commands.  We  have  chosen  to  pretend  we  are 
talking  to  a  very  simple  type  of  robot  (like  the  ones  being 
develooed  In  Al  projects  at  Stanford  and  MIT)  with  only  one  arm 
and  an  eye.  it  can  look  at  a  scene  containing  toy  objects  like 
blocks  and  balls,  and  can  manipulate  them  with  Its  hand. 

We  have  not  tried  to  use  an  actual  robot  or  to  simulate  It 
In  physical  detail.  Since  we  are  Interested  primarily  In 
complex  language  activity,  we  have  adopted  a  very  simplified 
model  of  the  world,  and  the  "robot"  exists  only  as  a  display  on 
the  CRT  scope  attached  to  the  computer. 
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QbJacx? 

First  we  must  decide  what  objects  we  will  have  In  the 
world.  In  3.1,  we  adopted  some  conventions  for  notation  In 
representing  objects  and  assertions.  Any  symbol  which  begins 
with  represents  a  specific  object,  while  anything  beginning 

with  •'#"  Is  the  rame  of  a  property  or  relation. 

The  model  begins  with  the  two  participants  In  the 
discussion,  the  robot  (named  :SHROLU),  and  the  person  (called 
:FRIEND}.  The  robot  has  a  hand  (:HAND),  and  manipulates  objects 
on  a  table  (tTABLE),  which  has  on  It  a  box  (:BOX).  The  rest  of 
the  physical  objects  are  toys  --  blocks,  pyramids,  and  balls. 

We  give  them  the  names  ;B1,  ;B2,  :B3,... 

Next  we  must  decide  on  the  set  of  concepts  we  will  use  to 
describe  these  objects  and  their  properties.  We  can  represent 
these  In  the  form  of  a  tree: 


t 

♦TABLE 

♦  BOX 

♦BLOCK 

#PHYS0B  — 

♦MAN IP - 

♦  BALL 

#ROBOT 

♦HAND 

♦PYRAMID 

♦PERSON 

♦STACK 

|♦COLOR 

♦PROPERTY- 

— 

1  ♦SHAPE 

Figure  54  —  Classification  of  Objects  and  Properties 

The  symbol  #PHYSOB  stands  for  ’’physical  object”,  and  #MANIP  for 
"manlpulable  object"  (l.e.  something  the  robot  can  pick  up). 
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We  could  use  these  as  simple  predicates#  and  have 
assertions  like  (IROBOT  tSHRDLU)#  (#HAND  ;HAND)#  and  (#PYRAMID 
:B5)  to  say  that  Shrdlu  Is  a  robot#  the  hand  Is  a  hand#  and  :B5 
Is  a  pyramid.  In  section  4.4.3#  wo  describe  the  way  the 
language  programs  choose  an  English  phrase  to  describe  an 
object.  In  order  to  do  so#  they  need  a  basic  noun  --  toe  one  we 
would  use  to  say  "this  Is  a  ...".  If  we  represented  the 
concepts  in  the  above  tree  using  simple  predicates#  and  then 
used  the  same  form  for  otner  piedlcates#  such  as  coiors  (for 
example#  (#BLUE  :85)),  the  language  generating  routines  would 
have  no  easy  way  to  know  which  was  the  "basic"  property.  It 
would  be  necessary  to  keep  lists  and  continually  check. 

Instead#  we  adopt  a  different  way  of  writing  these  concepts,  Vie 
use  the  concept  #15  to  mean  "has  as  Its  basic  description"#  and 
write  (#15  :5HRDLU  #ROBOT)#  (#15  ;H\ND  #HAN0)#  and  (#15  :B5 
#PYRAMID). 

Looking  at  the  tree#  we  see  that  the  properties  #PHY50B  and 
#MANIP  cannot  be  represented  In  this  fashion#  since  any  object 
having  them  also  has  a  basic  description.  We  therefore  write 
(#MANIP  :B5)  and  (#PHY50B  tTABLf). 

Next#  we  would  like  to  assign  physical  properties  to  these 
objects#  such  as  site#  shape#  color#  and  location.  Shape  and 
color  are  handled  wltn  simple  assertions  like  (#COLOR  ;B0X 
#WHITE)  and  (#SHAPE  :B5  #POINTED).  The  possible  snapes  are 
#R0UND#  #PO!MTED,  AND  #RECTANGULAR#  and  the  colors  are  #BLACK# 
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#RED/  #WHITE/  #GREEN,  and  #BLUE.  Of  course  It  would  Involve  no 
programming  to  Introduce  other  shape  or  color  names  --  all  that 
we  would  do  Is  use  them  In  an  assertion,  like  (#COLOR  :B11 
#MAUVE),  and  add  an  assertion  telling  what  type  of  thing  they 
are.  The  property  names  themselves  can  be  seen  as  objects,  and 
we  have  the  concepts  #COLOR  and  #SHAPE,  to  make  assertions  like 
(#IS  #BLUE  #C0L0R),  and  (#IS  #RECTA‘.GULAR  #SHAPE). 

Size  and  location  are  more  complex,  as  they  depend  on  the 
way  we  choose  to  represent  physical  space.  We  have  adopted  a 
standard  three-dimensional  coordinate  system,  with  coordinates 
ranging  from  0  to  1200  In  all  three  directions.  (The  number 
1200  was  chosen  for  convenience  in  programming  the  display). 

The  coordinate  point  (0  0  0)  Is  In  the  front  lower  left-hand 
corner  of  the  scene. 

We  have  made  the  simplifying  assumption  that  objects  are 
not  allowed  to  rotate,  and  therefore  always  keep  their 
orientation  aligned  with  the  coordinate  axes.  W®  can  represent 
the  position  of  an  object  by  giving  the  coordinates  of  Its  front 
lower  left-hand  corner,  and  can  specify  Its  size  by  giving  the 
three  dimensions.  We  use  the  symbols  #SIZE  and  #AT,  and  put  the 
coordinate  triples  as  a  single  element  In  the  assertions.  For 
example,  we  might  have  (#AT  :B5  (400  600  200}},  and  (#SIZE  :BS 
(100  100  300}}. 

Since  v/e  assume  that  the  robot  has  an  eye,  the  system 
begins  the  dialog  with  complete  Information  about  the  objects  In 
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the  scene/  their  shapes/  stzes/  colors/  and  locations.  In 
addition  to  the  PLANNER  assertions,  the  system  keeps  a  table  of 
sizes  and  locations  for  more  efficient  calculation  when  looking 
for  an  empty  space  to  set  something  down. 
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3.4.2  Re  1  at  I ons 

The  basic  relation?  v»-;  will  need  for  this  model  are  the 
spatial  relations  between  objects.  Since  we  are  Interested  In 
moving  objects  around  In  the  scene/  one  of  the  most  Important 
relations  Is  ^SUPPORT.  The  Initial  data  base  contains  all  of 
the  applicable  support  relations  for  the  Initial  scene#  and 
every  time  an  object  Is  moved#  an  antecedent  theorem  removes  the 
old  assertion  about  .hat  was  supporting  It#  and  puts  In  the 
correct  new  one.  We  have  adopted  a  very  simplified  notion  of 
support#  In  which  an  object  Is  supported  by  whatever  Is  directly 
below  Its  center  of  gravity#  at  the  level  of  Its  bottom  face. 

Therefore,  an  object  can  support  several  others#  but  there  Is 

only  one  thing  supporting  It.  Of  course  this  Is  an  extreme 

simplification  since  It  does  not  recognize  that  a  simple  bridge 

Is  supported.  If  this  program  were  to  be  adapted  to  use  with  an 

actual  robot#  a  much  more  general  Idea  of  support  would  be 

necessary.  Along  with  the  #SUPP0RT  relations#  we  keep  track  of 

the  property  #CLEARTOP.  The  assertion  <#CLEART0P  X)  will  be  In 

the  data  base  If  and  only  If  there  Is  no  assertion  (#SUPP0RT  X 

Y)  for  any  object  Y.  It  Is  also  kept  current  by  antecedent 

theorems  which  are  called  whenever  an  object  Is  moved.  This  ^ 

happens  automatically  whenever  an  assertion  of  the  form  (#AT  OBJ 

(X  Y  Z))  Is  made.  The  theorems  make  the  appropriate  check  to 

T 

see  whether  the  fCLEARTOP  status  of  any  object  has  changed#  and 
If  so  the  necessary  erasures  and  assertions  are  made. 


I 
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A  second  relation  which  Is  kept  In  the  data  base  Is 
^CONTAIN.  The  first  participant  must  be  the  box#  since  this  Is 
the  only  container  In  the  scene.  The  Information  about  what  Is 
container  in  the  box  Is  also  kept  current  by  an  antecedent 
theorem.  The  relation  #GRASPING  Is  used  to  Indicate  what 
object  (If  any)  the  robot's  hand  Is  grasping.  It  Is 
theoretically  a  two-place  predicate#  relating  a  grasper  and  a 
graspee#  as  In  (^GRASPING  :SHRDLU  :S2).  Since  there  Is  only  one 
hand  In  cur  scene#  It  Is  clear  who  must  be  doing  the  grasping# 
so  the  assertion  Is  reduced  to  (^GRASPING  ;B2). 

The  other  relation  which  Is  stored  In  the  data  base  Is  the 
fPART  relation  between  an  object  and  a  stack.  We  can  give  a 
name  to  a  stack#  such  as  ;S1#  and  assert  (#PART  :B2  ;S1).  As 
objects  are  moved#  the  changes  to  the  data  base  are  again  made 
automatically  by  antecedent  theorems  which  notice  changes  of 
location. 

As  we  explained  In  section  3.3.3#  we  must  decide  what 
relations  are  useful  enough  to  occupy  space  In  our  data  base# 
and  which  should  be  recomputed  from  simpler  Information  each 
time  we  need  them.  VJe  have  Included  relations  like  ^SUPPORT  and 
ICONTAIN  because  they  are  often  referenced  In  deciding  how  to 
move  objects.  We  can  think  of  other  relations#  such  as  the 
relative  position  of  two  objects#  which  can  be  computed  from 
their  locations#  and  are  not  used  often  enough  to  be  worth 
keeping  In  the  data  base  and  partially  recomputing  every  time 
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something  Is  moved.  We  represent  these  relations  using  the 
symbols  #RIGHT,  #BEHIND,  and  #ABOVE.  (These  represent  the 
direction  of  the  positive  coordinate  axis  for  X,  Y,  and  Z 
respectively).  We  do  not  need  the  converse  relations/  since  we 
can  represent  a  fact  like  *':B1  is  below  :B2"  by  (#ABOVE  !B2 
:B1)/  and  our  semantic  system  can  convert  what  is  said  to  this 
standard  format.  The  symbol  #0N  is  used  to  represent  the 
transitive  closure  of  #SUPPORT.  That  Is,  Z  Is  #0N  A  If  A 
supports  B/  B  sup;'orts  C, ...  supports  Z. 

The  three  spatial  relations  use  o  common  consecuent  theorem 
called  TC-LOC  which  decides  If  they  are  true  by  looking  at  the 
coordinates  and  sizes  of  the  objects.  The  #0N  relation  has  a 
consequent  theorem  TC-ON  which  looks  for  chains  of  support, 
(notice  that  the  prefix  TC-  stands  for  Theorem  Consequent/  and 
Is  attached  to  all  of  our  consequent  theorems.  Similarly/  TA- 
and  TE-  are  used  for  antecedent  and  erasing  theorems.) 

The  measurements  of  #HEIGHT/  #WIDTH/  and  i?LENGTH  are 
represented  as  a  simple  assertion/  like  (#HEIGHT  ;B3  100),  but 
they  are  not  stored  In  the  data  base.  They  are  computed  when 
needed  from  the  #SIZE  assertion,  and  can  be  accessed  by  using 
the  theorem  TC-MEASURE,  or  by  using  a  functional  rotation.  The 
expression  (#HEIGHT  X)  evaluates  to  the  heignt  of  whatever 
object  the  variable  X  Is  bound  to.  If  #S!ZE  Is  used  In  this 
way.  It  returns  a  measure  of  "overall  size”  to  be  used  for 
comparisons  like  "bigger".  Currently  It  returns  the  sum  of  the 
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X,  Y,  and  Z  coordinates,  but  It  could  be  easily  changed  to 
more  In  accord  with  human  psychology. 

In  order  to  compare  measurements,  we  have  the  relation 
#M0RE.  The  sentence  '*;B1  Is  shorter  than  !82"  Is  equivalent  to 
the  c_aertlon  (♦MORE  #HEIGHT  :B2  :B1).  Again,  we  do  not  need 
the  relation  "less"  since  we  can  simply  reverse  the  order  of  the 
objects.  The  relation  #ASMUCH  Is  used  In  the  same  way,  to 
express  "greater  than  or  equal".  Instead  of  "strictly  greater 
than".  None  of  these  assertions  are  stored  (If  we  have  ten 
objects,  there  will  be  almost  400  relationships),  but  are 
computed  from  more  basic  Information  as  they  are  needed. 

One  final  relationship  Is  #LIKE,  which  relates  a  person  or 
robot  to  any  object.  There  Is  a  theorem  which  shows  that  the 
robot  likes  everything,  but  knowledge  about  what  the  human  user 
likes  Is  gathered  from  his  statements.  The  semantic  programs 
can  use  statements  about  liking  to  generate  further  PLANNER 
theorems  which  are  used  to  answer  questions  about  what  : FRIEND 
1  ikes. 


m 
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3.4.3  Actions 

The  only  events  that  can  take  place  In  our  world  are 
actions  taken  by  the  robot  In  moving  Its  hand  and  manipulating 
objects.  At  the  most  basic  level,  there  are  only  three  actions 
v/hlch  can  occur  --  MOVETO,  GRASP,  and  UNGRASP.  These  are  the 
actual  commands  sen"  to  the  display  routines,  and  could 
theoretically  be  sent  directly  to  a  physical  robot  system. 

The  result  of  calling  a  consequent  theorem  to  achieve  a 
goal  requiring  r.-.otlon,  like  (#PUTON  ;B3  :84),  Is  a  plan  --  a 
list  of  Instructions  using  the  three  elementary  functions. 

MOVETO  moves  the  hand  and  whatever  It  Is  currently  grasping  to  a 
set  of  specified  coordinates.  GRASP  sets  an  Indicator  that  the 
grasped  object  s  to  be  moved  along  with  the  hand,  and  UNGRASP 
unsets  It.  The  robot  grasps  by  moving  Its  hand  directly  over 
the  center  of  the  object  on  Its  top  surface,  and  turning  on  a 
"magnet".  It  can  do  this  to  any  manipulable  object,  but  can 
only  grasp  one  thing  at  a  time.  Using  these  elementary  actions, 
we  can  build  a  hierarchy  of  actions.  Including  goals  which  may 
Involve  a  whole  sequence  of  deductions  and  actions,  like 
#STACKUP. 

The  semantic  programs  never  need  to  worry  about  details 
Involving  physical  coordinates  or  specific  motion  Instructions, 
but  can  produce  Input  for  higher-level  theorems  which  do  the 
deta 1 1 ed  work. 

f 

At  a  slightly  higher  level,  we  have  the  PLANNER  concepts 
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♦MOVEHANU,  #GRASP  and  fUNGRASP,  and  corresponding  consequent 
theorems  to  achieve  them.  There  Is  a  significant  difference 
between  these  and  the  functions  listed  above.  Calling  tne 
function  MOVETO  actually  causes  the  hand  to  move.  On  the  other 
hand,  when  PLANNER  evaluates  a  statment  like; 

(THG0AL(#M0VEHAN0  (600  200  300))(THUSE  TC-MOVEHAND ) ) 
nothing  is  actually  moved.  The  theorem  TC-MOVEHAND  is  called, 
and  It  creates  a  plan  to  do  the  motion,  but  If  this  move  causes 
us  to  be  unable  to  achieve  a  goal  at  some  later  point,  the 
PLANNER  backup  mechanism  will  automatical y  erase  It  from  the 
plan.  The  robot  plans  the  entire  action  before  actually  moving 
anything,  trying  all  of  the  means  It  has  to  achieve  Its  goal. 

The  theorems  also  do  some  checking  to  see  If  we  are  trying 
to  do  something  impossible.  For  example,  TC-MOVEHAND  makes  sure 
the  action  would  not  Involve  placing  a  block  where  there  Is 
already  an  object,  and  TC-UNGRASP  falls  unless  there  Is 
something  supporting  the  object  It  wants  to  let  go  of. 
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3.4.4  Carrying  Otit  Commands 

Some  theorems#  like  TC-GRASP#  are  more  complex#  as  they  can 
cause  a  series  of  actions.  In  this  section  we  will  fallov/ 
PLANNER  through  such  an  action,  using  the  simplified  theorems  of 
figure  55.  If  PLANNER  tries  the  goal: 

(THGOAL  (#GRASP  :B1)(THUSE  TC-GRASP)) 
the  theorem  TC-GRASP  can  do  a  number  of  things.  It  checks  to 
make  sure  :B1  Is  a  graspable  object  by  looking  In  the  data  base 
for  (#MANIP  ;B1).  If  the  hand  is  already  grasping  the  object# 

It  has  nothing  more  to  do.  If  not#  It  must  first  get  Che  hand 
to  the  obiecL.  This  may  Involve  complications  --  the  hand  may 
already  be  holding  something#  or  there  may  be  objects  sitting  on 
top  of  the  one  It  wants  to  grasp.  In  the  first  case#  It  must 
get  rid  of  whatever  Is  In  the  hand#  using  the  the  command  #GET- 
RID-OF.  The  easiest  way  to  get  rid  of  something  Is  to  set  It 
on  the  table#  so  TC-GET-RI D-OF  creates  the  goal  (#PUTCN  $?X 
:TABLE)#  where  the  variable  $?X  Is  bound  to  the  object  the  hand 
s  holding.  TC-PUTON  must  In  turn  find  a  big  enough  empty  place 
to  set  down  Its  burden#  using  the  command  #FINDSPACE#  wnich 
performs  the  necessary  calculations#  using  Information  about  the 
sizes  and  locations  of  all  the  objects.  TC-PUTON  then  creates  a 
goal  using  #PUT#  which  calculates  where  the  hand  must  be  moved 
to  get  the  object  Into  the  desrired  place#  then  calls  #MOVEHAND 
to  actually  plan  the  move.  If  we  look  at  the  logical  structure 
of  our  active  goals  at  this  point#  assuming  that  we  want  to 
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(DEFTHEOREM  TC-CUEARTOP 

(THCONSE  (X  Y)  (fCLEARTOP  $?X) 

GO  (THCOND  ((THGOAL  (#SUPP0RT  $?X  $_Y>) 
(THGOAL  (#GET-RID-OF  $7Y) 

(THOSE  TC-GET-RIO-OF)) 
(THGO  GO)) 

((THASSERT  (#CLEARTOP  $?X)))))) 


(DEFTHEOREM  TC-GET-R I D-OF 

(THCONSE  (X  Y)  (#GET-RID-OF  $?X) 

(THOR 

(THGOAL  (#PUTON  $?X  :TABLE) (THOSE  TC-PUTON)) 
(THGOAL  (#PUTOM  $?X  $?Y) (THOSE  TC-POTON) ) ) ) ) 

(DEFTHEOREM  TC-GRASP 

(THCONSE  (X  Y)  (#GRASP  $?X) 

( THGOAL (#MAN IP  $?X)) 

(THCOND  ((THGOAL  (#GRASPING  $7X))) 

((THGOAL  (#GRASPING  $_Y)) 

(THGOAL  (#GET-RID-OF  $?Y) 

(THOSE  TC-GET-RIO-OF)))) 

(T)) 

(THGOAL  (#CLEARTOP  $?X)  (THOSE  TC-CLEARTOP) ) 
(THSETQ  $_Y  (TOPCENTER  $?X:> 

(THGOAL  (#MOVEHAND  $7Y) 

(THOSE  TC-MOVEHANO)) 

(THASSERT  (#GRASPING  $?X)))) 


(DEFTHEOREM  TC-POT 

(THCONSE  (X  Y  Z)  (#POT  $?X  $?Y) 

(CLEAR  $?Y  (SIZE  $7X)  $?X) 

(SOPPORT  $7Y  (SIZE  $?X)  $?X) 

(THGOAL  (#GRASP  $?X)  (THOSE  TC-GR.A3P)) 

(THSETQ  $_Z  (TCENT  S7Y  (SIZE  $?X)); 

(THGOAL  (#MOVEHAND  $?Z)  (THOSE  TC-MOVEHAND) ) 
(THGOAL  (#ONGRASP)  (THOSE  TC-ONQRASP) ) ) ) 

(DEFTHEOREM  TC-POTUN 

(THCONSE  (X  Y  Z)  (#POTON  $?X  $?Y) 

(NOT  (EQ  $7X  $?Y)) 

(THGOAL  (#FINDSPACE  $?Y  $E  (SIZE  $?X)  $?X  $_Z) 
(THOSE  TC-FINDSPACE  TC-MAKESPACE) ) 
(THGOAL  (#POT  $?X  $?Z)  (THOSE  TC-POT)))) 

Figure  55  —  Simplified  PLANNER  Theor'ems 
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grasp  :B1/  but  were  already  grasping  :B2/  we  see: 

(#GRASP  :B1) 

(#GET-RID-0F  :B2> 

(i»PUT0N  :B2  STABLE) 

(#PUT  :B2  (453  201  0)) 

(#M0VEHAMD  (553  301  lOO) 

After  moving;.  TC-PUT  calls  #UNGRASP/  and  we  have  achieved 
the  first  part  of  our  original  goal  --  emptying  the  hand.  Now 
we  must  clear  off  the  block  we  want  to  grasp.  TC-GRASP  sets  up 
the  goal : 

(THG0AL(#CLEART0P  :B2)(7HUSE  TC-CLEARTOP) ) 

This  Is  a  good  exe'mple  of  the  double  use  of  PLANNER  goals  to 
both  search  the  data  base  and  carry  out  actions.  If  the 
assertion  (#CLEART0P  :B1)  Is  present.  It  satisfies  this  goal 
Immediately  wlthou.  calling  the  theorem.  However  If  :B1  Is  not 
already  clear,  this  THGOAL  statement  calls  TC-CLEARTOP  which 
takes  the  necessary  actions. 

TC-CLEARTOP  will  try  to  #GET-RID-OF  the  objects  on  top  of 
:B1.  This  will  In  turn  use  fPUTON,  which  uses  fPUT.  But  TC-PUT 
may  have  more  to  do  this  time,  since  the  hand  Is  not  already 
grasping  the  object  It  has  to  move.  It  therefore  sets  up  a  goal 
to  #GRASP  the  object,  calling  TC-GRASP.  We  have  gone  full 
circle,  and  are  back  In  the  theorem  we  started  In.  Of  course 
this  Is  no  problem  In  a  recursive  language,  and  the  process 
continues  as  It  should  until  all  of  the  goals  have  been 
achieved,  or  all  of  the  methods  Included  In  the  theorems  have 
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VJe  have  gone  through  this  example  In  some  detail  to  give  a 
feeling  for  the  goa 1 -or I entcd  programs  used  by  PLANNER.  The 
programs  are  highly  recursive,  with  a  clear  subgoal  structure, 
and  with  theorems  often  calling  themselves  to  achieve  subgoals. 
During  all  of  this,  PLANNER  Is  keeping  track  of  what  Is  being 
done  In  such  a  way  that  It  can  b^ck  up  and  try  something 
different  If  necessary.  For  example.  If  TC-GET-R I D-OF  puts  an 
object  on  the  table,  and  It  later  Is  In  the  way  of  something 
which  must  be  done,  a  failure  will  propagate  back  to  TC-GET-RID- 
OF,  and  I t  will  try  putting  It  somewhere  else. 

Figure  56  lists  the  different  action  concepts  more 
systematically,  showing  the  form  of  the  goal  statements,  and  the 
actions  taken  by  the  theorems  corresponding  to  them.  In  our 
program,  the  names  of  the  theorems  are  formed  by  adding  the 
prefix  TC-  for  a  consequent  theorm,  TA-  for  antecedent,  and  TE- 
for  erasing.  This  is  strictly  for  programmer  convenience  In 
recognizing  the  purpose  of  a  theorem  from  Its  name.  All  of 
these  goals  can  be  used  Internally  within  the  BLOCKS  system,  and 
most  of  them  can  be  called  directly  by  the  semantic  programs,  as 
direct  translations  of  English  commands.  Some,  like  #M0VEHAND 
cannot  be  called  by  the  linguistic  programs,  since  the  semantic 
routines  do  not  Include  a  way  to  specify  exact  coordinates  In 
English. 


Command 
(#M0VEHAND  (X 

(#UNGRASP) 

(#GRASP  X) 

(#PUT  W  (X  Y 

(#RAISEHAND) 

(#PICKUP  X) 
(#PUTON  X  Y) 

(#PUTIN  X  Y) 

(#GET-R!D-0F 

(#CLEART0P  X) 
(#STACKUP  (X 

(#FINDSPACE  A 


Section  3.4.4  -  Page  275 


Effect 


Y  Z))  f'iove  the  center  of  the  hand  to  location 
(X  Y  Z) .  Anything  being  grasped  goes 
along  automatically. 

Let  go  of  whatever  the  hand  Is  holding. 
Falls  If  the  object  Is  not  supported. 

Grasp  object  X,  doing  any  manipulations 
necessary  to  get  to  It. 

Z))  Put  VI  at  location  (X  Y  Z).  Falls  unless 
the  space  Is  empty. 

Raise  the  hand  and  whatever  It  holds  as 
high  as  1 t  wl 1 1  go. 

Grasp  X  and  raise  the  hand. 

Put  object  X  on  object  Y.  If  there  Is 
not  enough  free  space  on  Y,  move 
objects  to  make  It. 

The  same  as  #PUT0N,  except  that  Y  must  be 
a  box,  and  different  methods  are  used 
to  find  space. 

X)  1 ry  to  put  X  on  the  table,  and  If  that 
fails  put  It  on  anything  else. 

Get  rid  of  everything  which  Is  on  X. 

Y...))  Stack  X,  Y,  ...  on  top  of  each  other. 

In  order  of  size. 

(X  Y  Z)  B  $_C) 

This  goal  can  be  achieved  by  two  different 
theorems.  TC-FINDSPACE  tries  to  find  a 
space  of  size  (X  Y  Z)  on  top  of  object 
A,  counting  any  space  occupied  by  B  as 
empty.  $_C  Is  a  variable  binding  used 
to  return  the  answer.  If  this  falls, 
TC-MA.KESPACE  can  create  the  space  by 
moving  objects. 


Figure  56  --  BLOCKS  Commands 
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3.4.5  Memory 

In  order  to  answer  questions  about  past  events,  the  BLOCKS 
programs  remember  selected  parts  of  their  subgoal  tree.  Vhey  do 
this  by  creating  objects  called  events,  and  putting  th®m  on  an 
EVENTLIST.  The  system  does  not  remember  the  detailed  series  of 
specific  steps  like  #MOVEHAND,  but  keeps  track  of  the  larger 
goals  like  #PUT0N  and  #STACKUP.  The  time  of  events  Is  measured 
by  a  clock  which  starts  at  0  and  Is  Incremented  by  1  every  time 
any  motion  occurs.  The  theorems  which  want  to  be  remembered  use 
the  functions  MEMORY  and  MEMOREND,  calling  MEMORY  when  the 
theorem  Is  entered  and  MEMOREND  when  It  exits.  MEMOREND  causes 
an  event  to  be  created,  combining  the  original  goal  statement 
with  an  arbitrary  name  (chosen  from  El,  E2,...).  Recall  from 
Section  3.1  that  a  relation  can  Include  Its  own  name,  so  that* 
other  relations  can  refer  to  It.  If  we  call  TC-PUTON  with  the 
goal  (#PUTON  $?X  $?Y),  with  the  variables  X  and  Y  bound  to  :B1 
and  :B2  respectively,  the  resulting  event  which  is  put  Into  the 
data  base  Is  (fPUTON  El  :B1  :B2).  The  event  name  is  second. 
Instead  of  last  as  described  in  3.1  for  unimportant  technical 
reasons  which  will  be  changed  In  later  versions. 

In  addition  to  putting  this  assertion  In  the  data  base, 
MEMOREND  puts  Information  on  the  property  list  of  the  event  name 
--  the  starting  time,  ending  time,  and  reason  for  each  event. 

The  reason  Is  the  name  of  the  event  nearest  up  In  the  subgoal 
tree  which  is  being  remembered.  The  reason  for  goals  called  by 
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the  linguistic  part  of  the  system  Is  a  special  symbol  meaning 
"because  you  asked  me  to".  MEMORY  Is  called  at  the  beginning  of 

♦ 

a  theorem  to  establish  the  start  time  and  declare  that  theorem 
as  the  "reason"  for  the  subgoals  It  calls. 

A  second  kind  of  memory  keeps  track  of  the  actual  physical  * 

motions  of  objects,  noting  each  time  one  Is  moved,  and  recording 
Its  name  and  the  location  It  went  to.  This  list  can  be  used  to 
establish  where  any  object  was  at  any  past  time. 

When  we  want  to  pick  up  block  ;81,  we  can  say: 

(THGOAL( #P I CKUP  :B1}),  and  It  Is  Interpreted  as  a  command.  How 
can  we  ask  "Old  you  pick  up  :B1?"?  When  the  robot  picked  It  up, 
an  assertion  like  (#PICKUP  E2  :B1)  was  stored  in  the  data  base. 

Therefore  If  we  ask  PLANNER 
(THPROG(X) 

vTHGOAL  (fPlCKUP  $?X  .'81))) 

It  will  find  the  assertion,  binding  the  variable  X  to  the  event 
name  E2.  Since  tha  property  list  of  E2  gives  Its  sta^’tlng  and 
ending  times,  and  Its  reason,  this  Is  sufficient  Information  to 
answer  most  questions. 

If  we  want  to  ask  something  like  "Old  you  pick  up  :B1 
before  you  built  the  stack?"  we  need  some  way  to  look  for 
particular  time  Intervals.  This  Is  done  by  using  a  modified 
version  of  the  event  description.  Including  a  time  Indicator. 

The  exact  form  of  the  time  Indicator  Is  described  In  the  section 
on  semantics,  but  the  way  It  Is  used  to  establish  a  goal  Is: 
(THGOAL(#PICKUP  $?X  :B1  $?TIME) (THUSE  TCTE-PIGKUP) ) 
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The  prefix  TCTE-  on  the  name  of  a  theorem  means  that  It 
includes  a  time  and  an  event  name.  Ordinarily  when  such  a 
theorem  Is  entered,  the  variable  TIME  would  have  a  value,  while 
the  variable  X  would  not.  The  theorem  looks  through  the  data 
base  for  stored  events  of  the  form  (#PICKUP  $?X  :B1)  and  checks 
them  to  see  If  they  agree  with  the  time  TIME. 

For  some  events,  like  #PLIT0N,  this  Is  sufficient  since  the 
system  remembers  every  #PUTON  It  does.  For  others,  like  fPICKUP 
less  Information  Is  kept.  When  #PICKUP  Is  called  as  a  goal  at 
the  top  level.  It  Is  remembered.  But  the  system  doss  not 
remember  each  time  something  was  picked  up  In  the  course  of 
moving  the  toys  around.  The  fact  that  a  block  was  picked  up  can 
be  deduced  from  the  fact  that  It  was  put  somewhere,  and  the 
theorem  TCTE-PICKUP  actually  looks  at  a  number  of  different 
types  of  events  (like  #PUT0N  and  #PIITIN)  to  find  all  the 
occasions  on  which  an  object  was  really  picked  up. 

For  spatial  relations,  we  also  need  to  be  able  to  Include 
time,  fer  example,  "Was  the  block  behind  the  pyramid  before...?'* 
In  this  case,  no  assertions  are  stored,  since  the  memory  of 
motion  events  Is  sufficient  to  reconstruct  the  scene.  There  are 
special  theorems  with  the  prefix  TCT-  which  try  to  verify  a 
relation  with  a  time  condition.  For  example,  we  can  ask  "Is  :B1 
on  :B2?"  with  the  goal 

(THGOAL(#ON  ;B1  :B2)(THUSE  TC-ON)) 

To  ask  "Was  :B1  on  :B2  before...?"  we  bind  the  variable  TIME  to 
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the  representation  of  the  time  we  are  Interested  !n,  and  ask 
(THG0AL(#0N  ;B1  ;B2  $?TIME) (THUSc  TCT-ON)) 


The  theorem  TCT-ON  Is  the  same  as  TC-ON  except  that  It  deals 
with  the  specified  time  Instead  of  the  present.  Similar  TCT- 
theorems  exist  for  all  of  the  spatial  relations/  and  for 
properties  which  change  In  time/  such  as  #CLEARTOP  and  #AT. 

Appendix  C  Is  a  listing  of  parts  of  the  BLOCKS  program  as 
It  was  used  In  generating  the  sample  dialog.  PLANNER  theorems 
are  sufficiently  self-explanatory  that  only  a  few  comments  have 
been  Included.  The  auxllllary  LISP  functions  (such  as  the  one 
which  decides  whether  a  space  Is  empty)  are  not  Included/  but 
are  available  on  request,  along  with  the  complete  BLOCKS 
1 1  St  I ngs. 
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Chapter  4.  Semantics 


4.1  What  Is  Semantics? 

-It. 1.1  Jihs.  Pro'!' I  nee  M  Semantics 

The  field  of  semantics  has  always  been  a  hazy  swampland. 
There  Is  little  agreement  among  "semant I c I sts"  where  Its  borders 
lie  or  what  the  terrain  looks  like.  Logicians/  philosophers/ 
and  linguists  all  approach  It  with  the  tools  of  their  own  trade/ 
and  the  problem  of  just  defining  "semantics”  and  "meaning"  have 
occupied  volumes  of  debate. 

In  trying  to  program  computers  to  understand  natural 
language/  It  has  been  necessary  to  have  a  more  explicit  and 
complete  notion  of  semantics.  The  attempts  at  writing  language 
understanding  programs  have  made  It  more  clear  just  what  a 
semantic  theory  has  to  do,  and  how  It  must  connect  wl^h  the 
syntactic  and  logical  aspects  of  language.  In  practical  termS/ 
we  need  a  transducer  which  can  work  with  a  syntactic  analysis/ 
and  produce  data  which  Is  acceptable  to  a  logical  deductive 
system. 

In  the  preceding  chapters  we  have  described  the  two  ends  of 
a  language  system  --  a  syntactic  parser  with  a  grammar  of 
English/  and  a  deductive  system  with  a  base  of  knowledge  about  a 
particular  subject.  What  does  our  semantic  theory  have  to  do  to 
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fill  the  gap? 

In  section  3.1.2  we  outlined  the  basis  for  a  theory  of 
semantics.  It  Includes  a  world  of  "concepts"  and  structures  of 
concepts  which  are  postulated  by  the  linguist  In  trying  to 
explain  linguistic  phenomena.  These  are  not  a  psychological 
reality#  but  a  formalism  In  which  he  can  syrtematical ly  express 
those  aspects  of  meaning  which  are  relevant  to  language  use.  By 
manipulating  structures  In  this  formalism  as  a  part  of  analyzing 
sentences  in  natural  language#  the  theory  can  directly  deal  with 
problems  of  relating  meaning  to  parts  of  the  speaker's  and 
hearer's  knowledge  which  are  not  mentioned  explicitly  In  the 
sentence  being  analyzed. 

A  semantic  theory  must  describe  the  relationship  between 
the  words  and  syntactic  structures  of  natural  language  and  the 
postulated  formalism  of  concepts  and  operations  on  concepts.  In 
our  theory#  this  relationship  Is  described  as  a  set  of 
procedures  which  analyze  linguistic  forms  to  produce 
representations  of  meaning  In  the  Internal  conceptual  formalism. 
Just  as  with  the  grammar#  this  does  not  purport  to  be  a  model  of 
an  actual  process  taking  place  In  the  hearer  or  speaker.  The 
process  description  Is  used  because  It  Is  a  powerful  way  to 
describe  "neutral"  relationships#  as  well  as  being 
psychologically  suggestive. 

The  theory  must  describe  relationships  at  three  different 
levels.  First#  there  must  be  a  way  to  define  the  meanings  of 
words.  We  pointed  out  In  the  section  on  "meaning"  (section 
3,1)  that  the  real  "meaning"  of  a  word  or  concept  cannot  be 
defined  In  simple  dictionary  terms#  but  Involves  Its 
relationship  to  an  entire  vocabulary  and  structure  of  concepts. 
However#  we  can  talk  about  the  formal  description  attached  to  a 
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word  v;hlch  allows  It  to  be  Integrated  Into  the  systen,.  In  the 
rest  of  this  chapter,  we  will  use  the  word  "meaning”  In  this 
more  limited  sense,  describing  those  formal  aspects  of  the 
meaning  of  a  word  (or  syntactic  construction)  which  are  attached 
to  It  as  Its  dictionary  definition. 

The  formalism  for  definitions  should  not  depend  on  the 
details  of  the  semantic  programs,  but  should  allow  users  to  add 
to  the  vocabulary  In  a  simple  and  natural  way.  It  should  also 
be  possible  to  handle  the  quirks  and  Idlosyncracles  of  meaning 
which  words  can  have.  Instead  of  limiting  ourselves  to  "well- 
behaved"  standard  words. 

At  the  next  level  we  must  relate  the  meanings  of  the  words 
In  a  sentence  to  each  other  and  to  the  meaning  of  the  syntactic 
structures.  We  need  an  analysis  of  the  ways  In  which  English 
structures  are  designed  to  convey  meaning,  and  what  role  the 
different  words  and  syntactic  features  play  In  this  meaning. 

Finally,  a  sentence  In  natural  language  Is  never 
Interpreted  In  Isolation.  It  Is  always  part  of  a  context.*  and 
Its  meaning  Is  dependent  on  that  context.  A  theory  should 
explain  the  different  v/ays  In  which  the  "setting"  of  a  sentence 
can  affect  Its  meaning.  It  must  deal  both  with  the  linguistic 
setting  (the  context  within  the  discourse)  and  the  real-world 
setting  (the  way  meaning  Interacts  with  knowledge  of  non- 
1 Ingul St  I c  facts . ) 


i«3>.2  IM  SAPgntlg  g.ys.i;gfn 
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With  definite  goals  In  mind  for  a  semantic  system,  we  can 
consider  how  to  Implement  It.  First  let  us  look  at  what  It 
should  know  about  English.  As  we  have  been  emphasizing 
throughout  the  paper,  a  language  Is  not  a  set  of  abstract 
symbols.  It  Isa  system  for  conveying  meaning,  and  has  evolved 
with  very  special  mechanisms  for  conveying  just  those  aspects  of 
meaning  needed  for  human  communication. 

Section  3,1  discussed  the  person's  "model  of  the  world" 
which  Is  organized  around  notions  of  "objects",  having 
"properties"  and  entering  Into  "relationships,"  In  3.1.3,  these 
are  combined  to  form  more  complicated  logical  expressions. 
Looking  at  the  properties  of  English  syntax  (as  described  In 
Section  2.3}  we  see  that  these  basic  elements  of  the  "world 
model"  are  just  what  English  Is  good  at  conveying. 

For  describing  objects,  there  Is  the  NOUN  GROUP,  It 
contains  a  noun,  which  Indicates  the  kind  of  object;  adjectives 
and  classifiers,  which  describe  further  properties  of  the 
object;  and  a  complex  system  of  quantifiers  and  determiners 
describing  Its  logical  status  --  whether  It  Is  a  particular 
object,  ("the  sun"),  a  class  of  objects  ("people"),  a 
particular  set  of  objects  ("John’s  lizards"),  an  unspecified  set 
containing  a  specified  number  of  objects  ("three  bananas"),  etc. 
The  details  (described  In  section  4.2)  are  complex,  but  the 
Important  thing  Is  the  existence  of  a  systematic  structure. 


1 
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For  describing  reiationships  and  events^  there  are  the 
CLAUSE^  PREPOSITION  GROUP,  and  ADJECTIVE  GROUP.  The  CLAUSE  is 
especially  suited  for  dealing  with  reiationships  having  a 
particular  time  reference,  working  in  coordination  with  the  VERB 
GROUP,  which  functions  to  convey  Information  about  time,  using 
an  ingenious  system  of  tenses.  Clauses  can  also  be  used  to 
represent  an  event  or  relationship  as  an  object  (as  in  ”H i s 
going  pleased  me.”),  or  to  modify  a  particular  object  within  a 
NOUN  GROUP  (In  "the  man  who  broke  the  bank”) .  The  PREPG  Is  a 
less  flexible  and  simpler  way  of  expressing  relationships  which 
do  not  need  modifiers  such  as  time,  place,  and  manner  (such  as 
"the  man  Jji  the  bl ue  vest”) .  The  ADJG  Is  used  in  some 
constructions  to  describe  properties  and  some  special  kinds  of 
relationships  of  objects  (such  as  "Her  gift  was  bigger  than  a 
breadbox.”) 

The  semantic  system  Is  built  around  a  group  of  about  a 
dozen  programs  which  are  experts  at  looking  at  these  particular 
syntactic  structures.  They  look  at  both  the  structures  and  the 
meanings  of  the  words  to  build  up  PLANNER  expressions  which  will 
be  used  by  the  deductive  mechanism.  It  is  important  to  remember 
that  the  parser  uses  systemic  grammar  so  the  semantic  programs 
can  look  directly  for  features  such  as  PASSIVE  or  PLURAL  or 
QUESTION  to  make  decisions  about  the  meaning  of  the  sentence  or 
phrase. 


Since  each  of  these  semantic  "specialists”  can  work 
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separately^  there  Is  no  need  to  wait  for  a  complete  parsing 
before  beginning  semantic  analysts.  The  NOUN  GROUP  specialist 
can  be  called  as  soon  as  a  NOUN  GROUP  has  been  parsed/  to  see 
whether  It  makes  sense  before  the  parser  goes  on.  In  fact,  the 
task  can  be  broken  up,  and  a  preliminary  NOUN  GROUP  specialist 
can  be  called  In  the  middle  of  parsing  (for  example,  after 
finding  the  noun  and  adjectives,  but  before  looking  for 
modifying  clauses  or  prepositional  phrases)  to  see  whether  It  is 
worth  continuing,  or  whether  the  supposed  combination  of 
adjectives  and  noun  Is  nonsensical.  The  grammar  Is  in  the  form 
of  a  program,  so  it  Is  just  as  easy  to  call  a  semantic  routine 
at  any  time  as  a  syntactic  one.  Any  semantic  program  has  full 
power  to  use  the  deductive  system,  and  can  even  call  the  grammar 
to  do  a  special  bit  of  parsing  before  going  on  with  the  semantic 
analysis.  For  this  reason  it  is  very  hard  to  classify  the 
semantic  analysis  as  ”top-down”  or  "bottom-up".  In  general  each 
piece  of  the  structure  Is  analyzed  as  It  Is  parsed,  which  Is  a 
bottom-up  approach.  However  whenever  there  Is  a  reason  to  delay 
a  part  of  the  analysis  until  some  of  the  larger  structure  has 
been  analyzed,  it  Is  just  as  easy  to  write  the  semantic 
specialist  programs  In  this  top-down  manner.  In  our  system  both 
approaches  are  used. 
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Kar<l^ 

A  semantic  system  needs  to  deal  with  two  different  kinds  of 
words.  Some  words  are  Included  In  the  general  knowledge  of  the 
English  language.  Words  like  "that'*  or  "than".  In  "He  knew  that 
they  were  madder  than  hornets^"  would  be  difficult  to  define 
except  In  terms  of  their  place  In  the  sentence  structure.  They 
are  being  used  as  signals  of  certain  syntactic  structures  and 
features,  and  have  no  meaning  except  for  this  signalling  (which 
Is  recognized  by  the  grammar).  These  are  often  called  "function 
words"  In  distinction  to  the  "content  words"  which  make  up  the 
bulk  of  ojr  vocabulary.  This  Is  not  a  shcrp  distinction,  since 
many  words  serve  a  combination  of  purposes  (for  example,  numbers 
are  basically  "function  words",  but  each  one  has  Its  unique 
meaning).  We  can  generally  distinguish  between  words  like 
"that"  and  "than"  whose  meanings  are  built  Into  the  system,  and 
words  like  "snake",  "under",  and  "walk",  which  surely  are  not. 

The  definitions  of  content  words  should  not  have  to  Include 
"expert"  knowledge  about  the  semantics  or  grammar  of  the 
language.  In  defining  the  word  "mighty",  we  should  not  have  to 
worry  about  whether  It  appears  In  "The  sword  Is  mighty,"  or 
"the  mightiest  warrior",  or  "a  man  mightier  than  a  locomotive," 
We  should  be  able  to  say  "'Mighty'  means  having  the  property 
represented  conceptually  as  #MIGHT.",  and  let  the  semantic 
system  do  the  rest. 

We  need  a  semantic  1 anguaee  for  expressing  definitions  In  a 
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way  which  does  not  depend  on  the  grammar  or  the  particular 
semantic  programs.  Each  of  our  “specialists"  which  looks  at  the 
meanings  of  words  should  be  able  to  Interpret  those  statements 
In  the  semantic  language  which  might  be  relevant  to  Its  job. 

Section  4.2  describes  simple  formats  for  defining  common 
verbs,  nouns,  adjectives,  and  prepositions,  and  In  fact, 
these  definitions  do  not  look  much  like  programs  at  all.  Why 
then  do  we  call  this  a  "language"  Instead  of  saying  that  wa  have 
a  set  of  special  formats  for  defining  words?  The  distinction 
becomes  Important  for  all  of  the  Irregular  cases  and  the 
I diosyncrac les  that  words  can  have.  For  example.  In  "The  block 
Is  on  the  roof  of  the  car.",  "the  roof  of  the  car"  is  a  NG 
referring  to  a  particular  object  which  Is  a  roof.  But  If  we  say 
"The  block  Is  on  the  right  of  the  box",  we  “'•e  not  referring  to 
a  particular  object  which  Is  a  "right".  The  normal  NG  mechanism 
for  describing  objects  Is  being  used  Instead  to  describe  a 
relationship  between  the  block  and  the  box.  We  could  reprogram 
our  NOUN  GROUP  semantic  specialist  to  recognize  this  special 
case  and  treat  It  differently,  but  this  Is  a  path  leading  to  a 
roadblock.  We  will  not  be  able  to  anticipate  every  case,  and  as 
the  program  becomes  more  and  more  patched.  It  will  become  harder 
to  change  and  less  likely  to  work. 

What  we  need  Is  a  flexible  way  of  defining  words,  so  that 
the  word  "right"  Itself  can  cause  the  right  things  to  happen  In 
semantic  Interpretation,  without  changing  the  system.  This  Is 
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achieved  by  letting  the  definition  of  each  word  be  a  LISP 
program  to  be  run  at  an  appropriate  time  In  the  semantic 
analysis.  For  simple  cases,  there  are  standard  functions  with  a 
special  format  for  usual  types  of  definitions.  In  the  complex 
cases  there  Is  a  platform  from  which  to  operate,  doing  whatever 
calculations  and  changes  of  the  environment  are  needed.  This 
flexibility  Is  Important  In  many  places.  For  example,  the  wore 
"one"  v/hen  used  as  a  noun  (as  In  "the  green  one")  has  a  special 
use  for  referring  back  to  previously  mentioned  nouns.  It  could 
not  be  defined  by  a  simple  format,  as  could  "block"  or  "dog", 
since  It  Involves  complex  decisions  about  what  Is  really  being 
referred  to,  and  needs  access  to  the  previous  discourse.  In  our 
system.  Its  definition  as  a  noun  Is  compatible  with  the 
definitions  of  all  other  nouns  --  the  semantic  specialists  don't 
know  anything  about  It.  When  the  NG  specialist  Is  ready  to  use 
the  definition  of  the  noun.  It  calls  It  as  a  program.  In  the 
usual  case,  this  program  sets  up  a  standard  data  structure.  In 
the  case  of  "one".  It  calls  a  heuristic  program  for 
understanding  back-references,  and  Its  effect  on  the  meaning 
will  depend  on  the  discourse.  Similarly,  the  verb  "be"  Is 
called  like  any  other  verb  by  the  semantic  specialist,  but  In 
fact  Its  definition  Is  a  complex  program  describing  Its 
different  uses. 

The  use  of  procedures  to  represent  meanings  of  words  gives 
a  flexibility  which  allows  these  exceptional  words  to  be  handled 
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as  well  as  the  more  ordinary  forms.  At  the  same  time.  It 
provides  a  strict  test  of  representations  of  procedures  for 
particular  words,  since  tne  procedures  can  actually  be  run  In 
the  system. 
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4.1.4  Ambiguity 

A  semantic  theory  must  have  some  way  to  account  for 
multiple  meanings  of  words#  phrases,  and  sentences.  We  would 
like  to  explain  not  only  how  multiple  Interpretations  can  occur, 
but  also  how  the  hearer  sorts  them  out  to  pick  a  single  meaning. 

As  a  start,  we  must  allow  words  to  have  several  “senses", 
and  must  be  able  to  have  multiple  Interpretations  of  phrases  and 
sentences  to  correspond  to  them.  Next  we  must  realize  that  the 
syntactic  structures  can  also  lead  to  semantic  ambiguities. 
Sentences  like  the  famous  "Time  files  like  an  arrow."  derive 
some  of  their  ambiguity  from  their  ability  to  be  analyzed 
syntactically  In  more  than  one  way.  Finally,  we  Include  some 
ambiguities  as  a  result  of  the  semantic  analysis.  The  sentence 
"A  man  sitting  In  this  room  fired  the  fatal  shot."  will  be 
ambiguous  even  If  we  agree  on  a  single  meaning  for  each  word, 
and  a  surface  structure  for  the  sentence.  If  spoken  by  Perry 
Mocon  at  a  dramatic  moment  In  the  courtroom.  It  means  "a  man  who 
I s  sitting  In  this  room",  but  If  spoken  by  the  detectives  when 
they  broke  Into  the  empty  hotel  room  across  the  street  from  the 
scene  of  the  crime.  It  means  "who  was  sitting  In  this  room". 

This  could  be  treated  as  a  syntactic  ambiguity  In  the  deep 
structure,  but  In  our  analysis  It  Is  Instead  treated  as  a 
semantic  ambiguity  Involving  the  time  reference. 

In  describing  the  grammar  It  was  pointed  out  that  we  do  not 
carry  forward  simultaneous  parsings  of  a  sentence.  We  try  to 
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find  the  "best"  parsing^  and  try  other  paths  only  If  we  run  Into 
trouble.  In  semantics  we  take  the  other  aproach.  If  a  word  has 
two  meanings,  then  two  semantic  descriptions  are  built 
simultaneously,  and  used  to  form  two  separate  phrase 
Interpretations. 

We  can  Immediately  see  a  problem  here.  There  is  dire 
danger  of  a  combinatorial  explosion.  If  words  A#  B,  C,  and  D 
each  have  three  meanings,  then  a  sentence  containing  all  of  them 
may  have  3x3x3x3,  or  81  Interpretations.  The  possibilities  for 
a  long  sentence  are  astronomical. 

Of  course  a  person  does  not  build  up  such  a  tremendous 
list.  As  he  hears  a  sentence,  he  "filters  out"  all  but  the 
most  reasonable  Interpretations.  We  know  that  a  "ball"  can  be 
either  a  spherical  toy  or  a  dancing  party,  and  that  "green"  can 
mean  either  the  color  green,  or  unripe,  or  Inexperienced.  But 
when  we  see  "the  green  ball",  we  do  not  get  befuddled  with  six 
Interpretations,  we  know  that  only  one  m3’;cs  sense.  The  use  of 
"green"  for  "unripe"  applies  only  to  fruit,  the  use  as 
"Inexperienced"  applies  only  to  people,  and  the  color  only  to 
physical  objects.  The  meaning  of  "ball"  as  a  party  fits  none  of 
these  categories,  and  the  meaning  as  a  "spherical  toy"  fits  only 
the  last  one.  We  can  subdivide  the  world  Into  rough  c>asses 
such  as  "animate",  "Inanimate",  "physical",  "abstract",  "event", 
"human",  etc.  and  can  use  this  classification  scheme  to  filter 
out  meaningless  combinations  of  Interpretations. 
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Some  semantic  theories  <Fodor>  are  based  almost  completely 
on  this  Idea.  We  would  like  to  use  It  for  what  It  Is  --  not  a 
complete  representation  of  meaning,  but  a  rough  classification 
which  eliminates  fruitless  semantic  Interpretations.  Our  system 
has  the  ability  to  use  these  "semantic  markers"  to  cut  down  the 
number  of  semantic  Interpretations  of  any  phrase  or  sentence. 

A  second  method  used  to  reduce  the  number  of  different 
semantic  Interpretations  Is  to  do  the  Interpretation 
continuously.  We  do  not  pile  up  all  possible  Interpretations 
of  each  piece  of  the  sentence,  then  try  to  make  logical  sense  of 
them  together  at  the  end.  As  each  phrase  Is  completed.  It  Is 
understood.  If  we  come  across  a  phrase  like  "the  colorful  ball" 
In  contex'.,  we  do  not  keep  the  two  different  possible 
Interpretations  In  mind  until  the  utterance  Is  finished.  We 
Immediately  look  In  our  memory  to  see  which  Interpretation  Is 
meaningful  In  the  current  context  of  discourse,  and  use  only 
that  meaning  In  the  larger  semantic  analysis  of  the  sentence. 
Since  our  system  allows  the  grammar,  semantics  and  deduction  to 
be  easily  Intermixed,  It  Is  possible  to  do  this  kind  of 
continuous  Interpretation. 

Finally  we  must  deal  with  cases  where  we  cannot  eliminate 
all  but  one  meaning  as  "senseless".  There  will  be  sentences 
where  more  than  one  meaning  makes  sense,  and  there  must  be  some 
way  to  choose  tne  correct  one  In  a  given  context.  In  the 
section  on  context  below,  we  discuss  the  use  of  the  overall 
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discourse  context  In  asslginlng  a  plausibility  factor  to  a 
particular  Interpretation.  By  combining  the  plausibilities  of 
the  various  parts  of  a  sentence,  we  can  derive  an  overall  factor 
to  help  choose  the  best. 

There  will  always  be  cases  where  no  set  of  heuristics  will 
be  enough.  There  will  be  multiple  Interpretations  whose 
plausibilities  will  be  so  close  that  It  would  be  simply  guessing 
to  choose  one.  In  our  sample  dialogue,  there  Is  an  example  with 
the  word  "on".  "The  block  Is  on  top  of  the  pyramid."  could  mean 
either  "directly  on  the  surface"  or  "somewhere  above".  There  Is 
no  way  for  the  hearer  (or  computer)  to  read  minds.  The  obvious 
alternative  Ir  to  ask  the  speaker  to  explain  more  clearly  what 
Is  meant.  As  a  final  resort,  the  system  can  ask  questions  like 
"By  the  word  "on"  In  the  phrase  "on  top  of  green  blocks"  did  you 
mean  'directly  on  the  surface'  or  'somewhere  above'?".  The 
methods  used  for  handling  ambiguity  are  described  In  mere  detail 
In  section  4.2.10 


mrm 
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4.1.5  D t scourse 

At  the  beginning  of  our  discussion  of  semantics#  we 
discussed  why  a  semantic  system  should  deal  with  the  effect  of 
"setting"  on  the  meaning  of  a  sentence.  A  semantic  theory  can 
account  for  three  different  types  of  context. 

First#  there  Is  the  local  discourse  context#  which  covers 
the  discourse  Immediately  preceding  the  sentence#  and  Is 
Important  to  semantic  mechanisms  like  pronoun  reference.  If  we 
ask  the  question  "Did  you  put  Jjl  on  a  green  one?"  or  "Why?"  or 
"How  many  of  them  were  there  then?",  we  assume  that  It  will  be 
possible  to  fill  In  the  missing  Information  from  the  Immediate 
discourse.  There  are  a  number  of  special  mechanisms  for  using 
this  kind  of  Information#  and  they  form  part  of  a  semantic 
theory. 

Second#  there  Is  an  overal 1  d I scourse  context.  A  hearer 
will  Interpret  the  sentence  "The  group  didn't  have  an  Identity." 
differently  depending  on  whether  he  Is  discussing  mathematics  or 
sociology.  There  must  be  a  systematic  way  to  account  for  this 
effect  of  general  subject  matter  on  understanding.  in  addition 
to  the  effects  of  general  subject  on  choosing  betv?een  meanings 
of  a  word#  there  Is  an  effect  of  the  context  of  particular 
things  being  discussed.  If  we  are  talking  about  Argentina#  and 
say  "The  government  Is  corrupt."#  then  It  Is  clear  that  we  mean 
"the  government  of  Argentina".  If  we  say  "Pick  up  the 
pyramid."#  and  there  are  three  pyramids  on  the  table#  It  will 
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not  be  clear  which  one  Is  meant.  But  If  this  Imncrdlately 
follows  the  statement  "There  Is  a  block  and  a  pyramid  In  the 
box.",  then  the  reference  Is  to  the  pyramid  In  the  jox.  This 
would  have  been  clear  even  If  there  had  been  several  sentences 
between  these  two.  Therefore  this  Is  a  different  problem  than 
the  local  discourse  of  pronoun  reference.  A  semantic  theory 
must  deal  with  all  of  these  different  forms  of  overall  discourse 
context . 

Finally,  there  is  a  context  of  knowledge  about  the  world, 
and  the  way  that  knowledge  effects  our  understanding  of 
language.  If  we  say  "The  city  coui'c Ilmen  refused  the 
demonstrators  a  permit  because  they  feared  violence.",  the 
pronoun  "they"  will  have  a  different  Interpretation  than  If  we 
i:;?d  "The  city  councllmen  refused  the  demonstrators  a  permit 
because  they  advocated  revolution."  We  understand  this  because 
of  our  sophisticated  knowledge  of  councllmen,  demonstrators,  and 
politics  —  no  set  of  syntactic  or  semantic  rules  could 
Interpret  this  pronoun  reference  without  using  knowledge  of  the 
world.  Of  course  a  semantic  theory  does  not  Include  a  theory 
of  political  power  groups,  but  It  must  explain  the  ways  In  which 
this  kind  of  knowledge  can  Interact  with  linguistic  knowledge  In 
Interpreting  a  sentence. 

Knowledge  of  the  world  may  affect  not  only  such  things  as 
the  Interpretation  of  pronouns,  but  may  alter  the  parsing  of  the 
syntactic  structures  as  well.  If  we  see  the  sentence  "He  hit 
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the  car  with  a  rock."  the  structure  will  be  parsed  differently 
from  "He  hit  the  car  with  a  dented  fender."/  since  we  know  that 
cars  have  fenderS/  but  not  rocks. 

In  our  system#  most  of  this  discourse  knowledge  Is  called 
on  by  the  semantic  specialists#  and  by  particular  words  such  as 
"one"#  "It"#  "then"#  "there"#  etc.  We  have  concentrated 
particularly  on  local  discourse  context#  and  the  ways  In  which 
English  carries  Information  from  one  sentence  to  the  next.  A 
number  of  special  pieces  of  Information  are  kept#  such  as  the 
time#  place#  and  objects  mentioned  In  the  previous  sentence. 
This  Information  Is  referenced  by  special  structures  and  words 
like  pronouns#  "then"#  and  "there".  The  meaning  of  the  entire 
previous  sentence  can  be  referred  to  In  order  to  arswer  a 
question  like  "Why  did  you  do  that?"  or  just  "Why?". 

There  are  two  facilities  for  handling  overall  discourse 
context.  The  first  Is  a  mechanism  for  assigning  a 
"pi ausab 1 1 1 ty  factor"  to  an  Interpretation  of  a  word.  For 
example#  the  definition  of  the  word  "bank"  might  Include  the 
fact  that  If  we  are  discussing  money#  it  Is  most  likely  to  mean 
a  financial  Institution#  while  If  we  are  discussing  river'  It 
probably  means  the  edge  of  the  land.  Our  system  allows  the 
definition  of  a  word  to  Include  a  program  to  compute  a 
"plausabl 1 1 ty  factor"  (an  arbitrary  additive  constant)  for  each 
Interpretation.  This  computation  might  Involve  looking  at  the 
rest  of  the  sentence  for  key  words#  or  might  use  some  more 
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general  Idea,  like  keeping  track  of  the  general  area  of 
discussion  (perhaps  In  some  sort  of  network  or  block  structure) 
and  letting  the  plausibility  of  a  particular  meaning  depend  on 
•ts  "distance"  from  the  current  topic.  This  has  not  been 
implemented  since  we  have  Included  only  a  single  topic  of 
discourse  In  the  vocabulary.  It  Is  discussed  further  In  section 
5.2. 

The  second  type  of  overall  discourse  context  Involves  the 
objects  which  have  been  previously  mentioned.  Whenever  an 
object  or  one  of  its  properties  Is  mentioned,  either  by  the 
human  or  the  computer,  a  note  Is  made  of  the  time.  Later,  If  we 
use  a  phrase  like  "the  pyramid",  and  the  meaning  Is  not  clear, 
the  system  can  look  for  the  one  most  recently  mentioned. 

Finally,  the  knowledge  of  the  world  can  enter  Into  the 
semantic  Interpretation.  We  have  mentioned  that  the  grammar  can 
ask  the  semantic  Interpreter  "Does  this  NOUN  GROUP  make  sense?" 
before  continuing  the  parsing.  The  semantics  programs  can  In 
turn  call  on  PLANNER  to  make  any  deductions  needed  to  decide  on 
its  sensibility.  Thus  Information  about  the  world  can  guide  the 
parsing  dl rectly. 
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4>1.6  GcaU  ^  ^  Somant  I  c  Uisgi:^ 

We  have  set  ourselves  very  broad  goals  In  our  definition  of 
seTinnt  I cs/  asking  for  everything  which  needs  to  be  done,  rather 
than  llnltlng  ourselvc*:  to  those  aspects  which  can  be  explained 
and  characterized  In  a  neat  formalism.  How  does  this  compare 
with  the  more  limited  goals  of  a  semantic  theory  like  that  of 
Fodor  and  Katz  <Fodor>,  which  looks  only  at  those  aspects  of 
meaning  which  are  Independent  of  the  "setting"  of  a  sentence? 

We  have  seen  tnat  their  theory  of  "semantic  markers"  Is  In 
fact  a  part  of  the  "filtering"  needed  for  "exploiting  semantic 
relations  In  the  sentence  to  eliminate  potential  ambiguities" 
(<Fodor>p.  485)",  and  that  the  "semantic  d I s t I ngul shers"  are  a 
rudimentary  form  of  the  logical  descriptions  which  we  build  up 
to  describe  objects  and  events.  They  state  that  "the 
distinction  between  markers  an'  d I st I ngulshers  Is  meant  to 
coincide  with  the  distinction  between  that  part  of  the  meaning 
of  a  lexical  I  tern  which  Is  systematic  for  the  language  and  that 
part  of  the  meaning  of  the  I  tern  which  Is  not."  (<Fodor>  p.  498). 
We  believe  that  much  more  of  meaning  Is  systematic,  and  that  a 
semantic  theory  can  be  of  a  much  wider  scope. 

What  about  the  more  restricted  goals  a  semantic  theory 
might  achieve  such  as  "accounting  for....  the  number  and  content 
of  the  readings  of  a  sentence,  detecting  semantic  anomalies,  and 
deciding  upon  paraphrase  relations  between  sentences."?  In  a 
more  complete  semantic  theory,  these  are  not  primary  goals,  but 
by-products  of  the  analysis.  A  phrase  Is  a  semantic  anomaly  If  • 
the  system  produces  no  possible  interpretations  for  It.  Two 
sentences  are  paraphrases  If  they  produce  the  same 
representation  In  the  Internal  formalism  for  meaning,  and  the 
"number  and  content"  of  the  readings  of  a  sentence  are  the 
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Immediate  result  of  Its  semantic  analysis.  Which  of  these  will 
happen  depends  on  the  entire  range  of  ways  In  which  language 
communicates  meaning/  not  on  a  restricted  subset  such  as  the 
logical  relations  of  markers.  Once  we  have  a  conceptual 
representation  for  meaning/  problems  such  as  these  are  secondary 
byproducts  of  the  basic  analysis  which  relates  a  sentence  to  the 
representation  of  Its  meaning. 

In  addition/  we  can  talk  about  sentences  being  anomalies  or 
paraphrases  "In  context"/  as  well  as  "without  regard  to 
context"/  since  we  want  the  theory  to  Include  a  systematic 
analysis  of  those  features  of  context  which  are  relevant  to 
understand  1 ng. 
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4,2  Semantic  Structures 

The  previous  section  outlined  the  structure  of  a  semantic 
Interpreter,^  and  described  the  use  of  semantic  "specialists"  In 
analyzing  different  aspects  of  linguistic  structure.  Each 
specialist  is  In  the  form  of  a  procedure  v/hich  describes  the 
particular  aspect  of  semantics  which  with  it  Is  concerned.  We 
can  look  at  Its  function  as  creating  a  part  of  a  complete 
description  of  the  meaning  of  the  sentence  by  building  complex 
list  structures  which  we  will  call  "semantic  structures"  to 
describe  objects  and  relationships.  Events  are  a  type  of 
relationship  (Involving  time),  and  the  class  of  "object" 

Includes  anything  which  could  be  treated  as  an  object  In  English 
grammar,  even  If  It  Is  as  abstract  as  "tfuth".  There  are  two 
basic  types  of  structures  used  —  one  to  describe  objects,  (an 
Object  Semantic  Structure,  or  OSS)  and  the  other  to  describe 
relationships  (an  RSS).  In  general,  noun  groups  are  Interpreted 
to  form  object  structures,  while  the  other  groups  and  clauses 
are  Interpreted  to  form  relationship  structures.  Words  already 
have  a  semantic  structure  of  their  own  (their  definition)  and 
are  used  In  building  up  the  structures  for  the  larger  units 
which  contain  them. 
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4.2.1  Object  Semantic  ,S£co.c.t;u.r.e.s 

Let  us  first  look  at  the  semantic  structures  used  to 


describe  objects.  First,  we  need  the  actual  PLANNER  statements 
which  will  be  used  In  deducing  things  about  the  objects.  An  NG 
like  "a  red  cube"  can  be  described  using  the  formalism  of 
Chapter  3: 


(THPROG  (XI) 

(THG0AL(#IS  S?X1  #BL0CK)) 
(#EQDIM  $?X1) 

(THG0AL(#C0L0R  $?X1  #RED))) 
Figure  57  --  Simple  PLANNER  Description 


The  variable  "Xl”  represents  the  object,  and  this 
description  says  that  it  should  be  a  block,  it  should  have  equal 
dimensions,  anc  It  should  be  red.  (See  section  3.4  for  the 
details  of  representation).  A  phrase  such  as  "a  red  cube  which 
supports  three  pyramids  but  Is  not  contained  in  a  box"  has  a 
more  complex  dercriptlon.  This  would  be  built  up  from  the 
descriptions  for  the  various  objects,  and  would  end  up; 


(THPROG(Xl) 

(THG0AL(#IS  $?X1  fBLOCK)) 

(#EQDIM  $?X1) 

(THG0AL(#C0L0R  $?X1  #RED)) 

(THFIND  3  $?X2  (X2)  (THG0AL(#IS  $?X2  #PYRAMID)) 

(THG0AL(#SUPP0RT  $?X1  S?X2))) 

(THN0T(THPR0G(X3) 

(TH60AL(#IS  $?X3  #B0X)) 

(THG0AL(#C0NTAIN  $?X3  $?X1))))) 

Figure  58  --  PLANNER  Description 


We  can  learn  how  the  semantic  specialists  work  by  watching 
them  build  the  pieces  of  this  structure.  First  take  the  simpler 
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NG,  "a  red  cube".  The  first  NG  specialist  doesn't  start  work 
until  after  the  noun  has  been  parsed.  The  PLANNER  description 
Is  then  built  backwards,  starting  with  the  noun,  and  continuing 
In  r I gh t-to-l ef t  order  through  the  classifiers  and  adjectives. 
The  beginning  of  the  NG,  with  the  determiner,  number,  and 
ordinal  Is  handled  by  a  part  of  the  NG  specialist  described 
later.  The  first  NG  specialist  Is  named  SMNGl  --  all  of  the 
names  begin  with  SM  (for  "semantic"),  followed  by  the  name  of, 
the  unit  they  work  with,  followed  by  a  number  Indicating  the 
order  In  which  they  are  called.  SMNGl  sets  up  an  environment 
(we  will  describe  various  parts  of  t t  as  we  go),  then  calls  the 
definition  of  the  noun.  (Remember  that  definitions  are  In  the 
form  of  programs).  For  simple  nouns  there  Is  a  standard 
function  to  define  them  easily.  What  should  the  definition 
Include?  First,  a  way  to  Indicate  the  PLANNER  statements  which 
are  the  heart  of  Its  meaning.  The  symbol  "***"  Is  used  to 
represent  the  object,  so  our  definition  of  "cube"  contains  the 
expression: 

((#IS  ***  #BL0CK)(#EaDIM  **♦)) 

The  syntax  of  PLANNER  functions  such  as  THPROG  and  THGOAL  will 
be  added  by  the  specialists,  since  we  v/ant  to  keep  the 
definition  as  simple  as  possible. 

There  Is  one  other  part  of  the  definition  for  a  noun  --  the 
semantic  markers,  used  to  filter  out  meaningless 
I ntorpretat Ions  of  a  phrase.  The  definition  needs  to  attach 
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these  semantic  markers  to  each  OSS.  The  BLOCKS  world  uses  the 
tree  of  semantic  markers  In  Figure  59. 

This  Is  the  same  type  of  diagram  used  for  grammars.  In 
which  vertical  bars  represent  choices  of  mutually  exclusive 
markers,  while  horizontal  lines  represent  logical  dependency. 

The  symbol  "#PHYS0B"  means  "physical  object",  and  "#MANIP"  means 
"manipulable  object".  The  word  "cube"  refers  to  an  object  vHth 
the  markers  (#THING  #PhYS0B  #MANIP  #BL0CK).  We  shouldn't  need 
to  mention  all  of  these  In  the  definition,  since  the  presence  of 
#BL0CK  implies  the  others  through  the  logical  structure  of  the 
marker  tree. 

The  definition  of  the  noun  "cube"  Is  then: 

(NMEANS((#BLOCK)((#IS  ***  #BL0CK) ( #EQD IM  ***)))) 

NMEANS  Is  the  name  of  the  function  for  dealing  with  nouns, 
and  It  accepts  a  list  of  different  meanings  for  a  word.  In  this 
case,  there  Is  only  one  meaning.  The  first  part  of  the 
definition  Is  the  marker  list,  followed  by  the  reduced  PLANNER 
definition.  V^hen  NMEANS  Is  executed.  It  puts  this  Information 
onto  the  semantic  structure  which  Is  being  built  for  the  object. 
It  takes  care  of  finding  out  what  markers  are  Implied  by  the 
tree,  and  deciding  which  predicates  need  to  be  In  a  THGOAL 
statement  (like  #IS),  and  which  are  LISP  predicates  (like 
*EQ0IM).  We  will  see  later  how  It  also  can  decide  what 
recommendation  lists  to  put  onto  the  PLANNER  goals,  to  guide  the 
deduction. 
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THING# 


#NAME 

#PLACE  #SHAPE 

#PROPERTY — 

#SIZE 

#L0CATI0N 

#COLOR 

#AN IMATE - 


/^SPECTRUM 


#BLUE 

#RED 

#BLACK 

#WHITE 

#GREEN 


#R0B0T 

#HUMAN 


#physobH 


I 


#STACK 


#COMSTRUCT— H#PILE 


#RELATION 


H 


#HAND 
#TA8LE 
#MAN  I  P 
#B0X 
#EVENT 
#T I ME LESS 


I 


#R0W 


#PYRAMID 

#BLOCK 

#BALL 


Figure  59 


Semantic  Markers  for  the  BLOCKS  Vocabulary 
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SMNGl  then  calls  the  definition  for  the  adjective  "red". 

We  would  like  this  definition  to  include  the  PLANNER  assertion 
(#COLOR  ***  #RED),  and  Indicate  that  It  applies  only  to  physical 
objects.  We  can  use  the  same  format  used  for  nouns#  defining 
"red"  as: 

(NMEANS((#PHYSOB) ((#C0L0R  ♦**  #RED)>)) 

Notice  that  there  Is  no  distinction  made  betv;een  the  use  of 
#PHYSOB  here  to  Imply  "applies  only  to  physical  objects"  and  the 
use  of  #BL0CK  In  the  definition  of  "cube"  to  say  "this  Is  a 
block".  This  Is  because  of  the  way  the  markers  are 
Implemented.  The  marker  list  In  a  definition  is  Interpreted  to 
mean  "this  definition  applies  only  If  none  of  the  markers  here 
are  In  conflict  with  any  of  the  markers  already  established  for 
the  object".  Since  the  noun  Is  the  first  thing  Interpreted#  Its 
markers  cannot  possibly  conflict#  and  are  simply  entered  as  the 
Initial  marker  list  for  the  object.  The  marker  programs  are 
designed  so  that  we  do  not  need  to  limit  ourselves  to  a  single 
tree  --  we  could  classify  objects  along  several  dimensions#  and 
set  up  separate  marker  trees  for  each.  For  example#  we  might 
classify  objects  both  by  their  physical  properties  and  by  their 
use. 

The  order  of  analysis  of  modifiers  la  quite  natural  to  the 
use  of  "relative"  modifiers.  It  Is  Impossible  to  give  an 
absolute  definition  for  "big"  or  "little"#  since  a  "big  flea"  Is 
still  not  much  competition  for  a  "little  elephant". 


The  meaning 
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of  the  adjective  Is  relative  to  the  noun  It  modifies.  In  fact/ 

It  may  also  be  relative  to  the  adjectives  following  It  as  well. 

A  "big  toy  elephant"  Is  on  a  scale  of  Its  own.  Since  our  system 
analyzes  the  MR  from  right  to  left,  the  meaning  of  each 
adjective  Is  added  to  the  description  already  built  up  for  the 
head  and  modifiers  to  the  right.  Since  each  definition  Is  a 
program.  It  can  just  as  well  be  a  program  which  examines  the 
description  (both  the  semantic  markers  and  the  PLANNER 
description)/  and  produces  an  appropriate  meaning  relative  to 
the  object  being  described.  This  might  be  In  the  form  of  an 
absolute  measurement  (e.g.  a  "big  elephant"  Is  more  than  12  feet 
tall)  or  can  remain  In  a  relative  form  by  producing  a  PLANNER 
expression  of  the  form  "the  number  of  objects  fitting  the 
description  and  smaller  than  the  one  being  described  Is  more 
than  the  number  of  suitable  objects  bigger  than  It  Is". 

In  adding  the  meaning  of  "red"  to  the  semantic  structure, 
the  specialist  must  make  a  choice  In  ordering  the  PLANNER 
expressions.  We  remember  from  section  2.3  that  the  order  of 
expressions  can  be  Important,  since  variable  assignments  are 
done  In  the  order  encountered.  If  we  have  the  first  sequence 
shown  In  Figure  60,  PLANNER  will  look  through  all  of  the  blocks, 
checking  until  It  finds  one  which  Is  red.  However  If  we  have 
the  second.  It  will  look  through  all  of  the  red  objects  until  It 
finds  one  which  Is  a  block.  In  the  robot's  tiny  world,  this 
Isn't  of  much  Importance,  but  If  we  had  a  data  base  which  could 


r‘ 
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take  phrases  like  "a  man  In  this  room'*/  we  would  certainly  be 
better  off  looking  around  the  room  first  to  see  what  was  a  man, 
than  looking  through  all  the  men  In  the  world  to  see  If  one  was 
In  the  room. 

(THPROG(X) 

(THGOAL(#IS  $?X  #BL0CK)) 

(THG0AL(#C0L0R  $ ?X  #RED))) 


(THPROG(X) 

(THGOAL(#COLOR  $?X  #RED)) 

(THGOAL(#IS  $?X  #BL0CK))) 

Figure  60  —  Ordering  Goals 

To  make  this  choice  we  allow  each  predicate  (like  #IS  or 
#C0L0R)  to  have  associated  with  It  a  program  which  knows  how  to 
evaluate  Its  "priority"  In  any  given  environment.  The  program 
might  be  as  simple  as  a  single  number/  which  would  mean  "this 
relation  always  has  this  priority".  It  might  on  the  other  hand 
be  a  complex  heuristic  program  which  takes  Into  account  the 
current  state  of  the  world  and  the  discussion.  In  our 
definitions,  we  have  adopted  the  simpler  alternative,  assigning 
fixed  priorities  In  the  tenge  0  to  1000  arbitrarily.  By  keeping 
track  of  the  priority  of  the  expression  currently  at  the  top  of 
the  PLANNER  description,  the  function  NMEANS  can  decide  whether 
to  add  a  new  expression  above  or  below  It. 

Let  us  now  look  at  the  actual  structure  which  would  be 
built  up  by  the  program; 


I 
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(  (  ((XI)  200  (THG0AL(#IS  $?X1  #BL0CK)) 

(THG0AL(#C0L0R  $?X1  #RED)) 
(#EQDtM  $?X1)) 

PLANNER 
dt^scrl  ptlon 

(0  #BL0CK  #MANtP  #PHYS0B  #THING) 

merkers 

(#MAN!P  #PHYS0B  #TH1NG) 

systems 

XI 

variable 

(NS  INDEF  NIL) 

determiner 

NIL) 

ordinal 

Figure  61  —  OSS  for  "a  red 

cube” 

Most  of  the  parts  of  this  structure  (called  an  Obj ect 
Semantic  Structure  or  OSS)  have  already  been  explained.  The 
PLANNER  description  Includes  a  variable  list  (we  will  see  Its 
use  later),  the  priority  of  the  first  expression,  and  a  list  of 
PLANNER  expressions  describing  the  object.  The  '•markers” 
position  lists  all  of  the  semantic  markers  applicable  to  the 
object.  The  0  at  the  beginning  of  the  list  Is  the 
”p? ausab 1 11 ty”  of  this  Interpretation.  This  factor  was 
discussed  In  section  4.1.4,  and  l^  set  when  we  are  faced  with 
more  than  one  possible  Interpretation  of  a  word.  Each  semantic 
structure  carries  along  with  It  an  accumulated  plausibility 
rating.  This  will  remain  0  unless  It  Is  set  specifically  by  an 
ambiguity. 

The  "systems”  position  Is  a  list  of  all  of  the  nodes  In 
the  set  of  marker  trees  (remember  that  there  can  be  more  than 
one)  which  have  already  had  a  branch  selected.  It  Is  used  In 
looking  for  marker  conflicts.  The  "variable”  Is  the  variable 
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name  chosen  to  represent  this  object.  The  system  generates  it 
from  the  set  XI,  XI,  X3...,  providing  a  new  one  for  each  new 
structure.  The  only  two  positions  left  are  the  determiner  and 


the  ordinal 


These  are  explained  In  section  4.2.4 
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Let  us  now  take  a  slightly  more  complicated  NG,  "a  red  cube 
which  supports  a  pyramid,”  and  follow  the  parsing  and  semantic 
analysis.  First,  the  MG  parsing  program  finds  the  determiner 
("a”),  adjective  ("red”),  and  noun  ("cube”).  At  this  point 
SMNGl  Is  called  and  creates  the  structure  described  In  the 
previous  section.  Notice  that  the  MG  Is  not  finished  when  SMNGl 
Is  called  --  It  has  only  reached  a  point  where  we  can  do  a  first 
analysis.  At  this  point,  the  NG  might  be  rejected  without 
further  parsing  If  the  combination  of  noun,  classifiers,  and 
adjectives  Is  contradictory  to  the  system  of  semantic  markers. 

Next  the  NG  program  looks  for  a  qualifier,  and  calls  the 
CLAUSE  part  of  the  grammar  by  (PARSE  CLAUSE  RSQ) .  The  feature 
RSQ  (rank  shifted  qualifier)  Informs  the  CLAUSE  program  that  It 
should  look  for  a  RELWD  like  "which".  It  does,  and  then  looks 
for  a  VG,  succeeding  with  "supports".  The  VG  program  calls  Its 
own  semantic  specialist  to  analyze  the  time  reference  of  the 
clause,  but  we  will  Ignore  this  for  now.  Next,  since  "support" 
Is  transitive,  the  CLAUSE  looks  for  an  object,  and  calls  the  NG 
program.  This  operates  In  the  same  way  as  before,  producing  a 
semantic  structure  to  describe  "a  pyramid".  The  definition  of 
"pyramid"  Is: 

(NMEANS((#PYRAMID)((#IS  ***  #PYRAMID)))) 
so  the  resulting  structure  Is: 
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(  ((X2)  200  (THG0AL(*IS  S?X2  ^PYRAMID))) 

(0  fPYRAMID  #MAN1P  #PHYS0B  #THING) 
{#MANIP  #PHYS0R  #THING)) 

X2 

(NS  INDEF  NIL) 

NIL) 

Figure  62  --  OSS  for  "a  pyramid" 


At  this  point  the  first  CLAUSE  specialist  Is  called  to 
analyze  the  clause  "which  supports  a  pyramid".  We  want  to 
define  verbs  In  a  simple  way^  as  we  do  nouns  and  adjectives/ 
saying  something  like  "If  the  subject  and  object  are  both 
physical  objects/  then  "support"  means  the  relation  #SUPP0RT 
between  them  In  that  order".  This  Is  written  formally  using  the 
function  CMEANS/  as: 

(CMEANS((((#PHYSOB))((#PHYSOB)))(#SUPPORT  #1  #2)NIL)) 

All  of  the  extra  parentheses  are  there  to  leave  room  for 
fancier  options  which  will  be  described  later.  The  Important 
parts  are  the  semantic  marker  lists  for  the  objects 
participating  In  the  relationship/  and  the  actual  PLANNER 
expression  naming  It.  The  symbols  "#1"  and  "#2"  (and  "#3"  If 
necessary)  are  used  to  Indicate  the  objects/  and  the  normal 
order  Is  1.  semantic  subject  (SMSUB)  2.  semantic  first  object 
(SMOBl)  3.  semantic  second  object  (SM0B2).  Notice  that  we 
have  prefixed  the  word  "semantic"  to  each  of  these.  In  fact/ 
they  may  very  well  not  be  the  actual  syntactic  subject  and 
objects  of  the  clause.  In  this  example/  the  SMSUB  Is  the  NG  "a 
red  cube"  to  which  the  clause  Is  being  related.  SMCLl  knows 


-4 
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this  since  the  parser  has  noted  the  feature  SU3JREL.  Before 
calling  the  definition  of  the  verb^  SMCLl  has  found  the  OSS 
describing  "a  red  cube”  and  set  It  as  the  value  of  the  variable 
SMSUB.  Similarly  It  has  taken  the  OSS  for  ”a  pyramid”  and  put 
It  In  SMOBl,  since  it  Is  the  object  of  the  clause.  The 
definition  of  the  verb  "support”  Is  now  called,  and  CMEANS  uses 
the  Information  In  the  definition  to  build  up  a  Relation 
Semantic  Structure  (RSS).  First  it  checks  to  make  sure  that 
both  objects  are  compatible  with  their  respective  marker  lists. 
The  marker  lists  are  In  the  same  order  as  the  symbols  fl,  #2, 
and  #3.  In  this  case,  both  the  subject  and  object  must  be 
physical  objects. 

Next  SMCLl  substitutes  the  objects  Into  the  relation.  If 
It  Inserted  the  actual  semantic  structures,  the  result  would  be 
hard  to  read  and  time-consuming  to  print.  Instead,  the  NG 
specialists  assign  a  name  to  each  OSS,  from  the  set  MGl,  NG2, 
NG3,...  We  therefore  get  (^SUPPORT  NGl  NG2)  as  the  description 
of  the  relationship.  The  final  semantic  structure  for  the 
clause  (after  a  second  specialist,  SMCL2  has  had  a  chance  to 
look  for  modifiers  and  rearrange  the  structure  Into  a  convenient 
form)  Is: 
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(NGl  (#SUPPORT  NGl  NG2)  NIL)  (0)) 

rel  relation  neg  markers 

Figure  63  --  Relation  Semantic  Structure  1 
The  position  marked  "rel"  holds  the  name  of  the  NG 
description  to  which  this  clause  serves  as  a  modifier.  We  will 
see  later  that  It  can  be  used  In  a  more  general  way  as  well. 

The  "relation"  Is  the  material  for  PLANNER  to  use#  and  "neg" 
marks  whether  the  clause  Is  negative  or  not. 

The  last  element  Is  a  set  of  semantic  markers  and  a 
priority#  just  as  we  had  with  object  descriptions. 

Relationships  have  the  full  capability  to  use  semantic  markers 
just  as  objects  do#  and  at  an  early  stage  of  building  a  relation 
structure.  It  contains  a  PLANNER  description#  markers#  and 
systems  In  the  Identical  form  to  those  for  object  structures 
(this  Is  to  share  some  of  the  programs#  such  as  those  which 
check  for  conflicts  between  markers).  We  can  classify  different 
types  of  events  and  relationships  (for  example  those  which  are 
changeable#  those  which  Involve  physical  motion#  etc.)  and  use 
the  markers  to  help  filter  out  Interpretations  of  clause 
modifiers.  For  example#  the  modifying  PREP6  "without  the 
shopping  list"  In  "He  left  the  house  without  the  shopping  list" 
has  a  different  Interpretation  from  "without  a  hammer"  In  "He 
built  the  house  without  a  hammer."  If  we  had  a  classification  of 
activities  which  Included  those  involving  motion  and  those  using 
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tools#  we  could  choose  the  correct  Interpretation.  A  system  can 
^  be  constructed  which  operates  much  like  Fillmore's  ccse  system# 

assigning  classes  of  verbs  according  to  the  type  of  modification 
they  take#  and  using  this  to  find  the  correct  relation  between  a 
•  verb  and  Its  modifying  phrase.  This  will  be  discussed  more  In 

the  section  or  types  of  PREPG. 

In  our  limited  viorld#  we  have  not  set  up  a  marker  tree  for 
relationships  and  events#  so  we  have  not  Included  any  markers  In 
the  definition  of  "support".  The  marker  list  In  the  RSS 
therefore  contains  only  the  plausibility#  0.  The  "NIL"  In  the 
definition  Indicates  that  there  are  no  markers#  and  would  be 
replaced  by  a  list  of  markers  If  they  were  used. 

The  clause  Is  nov/  finished#  and  the  specialist  on  relative 
clauses  (SHRSO.)  Is  called.  Its  task  Is  to  take  the  Information 
contained  In  the  PLANNER  descriptions  of  the  objects  Involved  In 
the  relation#  along  with  the  relation  Itself#  and  to  put  It  all 
onto  the  PLANNER  description  of  the  object  to  which  the  clause 
Is  being  related.  The  way  In  which  this  Is  done  depends  on  the 
exact  form  of  the  different  objects  (particularly  on  their 
determiners).  In  this  case#  It  Is  relatively  easy#  and  the 
description  of  "a  red  cube  v;hlch  supports  a  pyramid"  becomes: 
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(  (  ((XI  X2)  200  (THG0AL(#IS  $?X1  #BL0CK)) 

(THG0AL(#C0L0R  $?X1  #RED)) 
(#EQDIM  $?X1) 

(THG0AL(#IS  $?X2  #PYRAMID)) 
(THG0AL(#SUPP0RT  $?X1  $?X2))) 


(0  #BL0CK  #MAN!P  fPHYSOB  #THING) 

(#MANIP  #PHYSOB  #TH!NG)) 

XI 

(NS  INDEF  NIL) 

NIL) 

Figure  64  —  OSS  for  "a  block  which  supports  a  pyramid” 

The  only  thing  which  has  changed  Is  the  PLANNER 
description,  which  now  holds  all  of  the  necessary  information. 
Its  -variable  list  contains  both  XI  and  X2,  and  these  variable 
names  have  been  substituted  for  the  symbols  NGl  and  NG2  In  the 
relation,  which  has  been  comolned  with  the  separate  PLANNER 
descriptions  for  the  objects.  Section  4.2.4  describes  how  a 
relative  clause  works  with  other  types  of  NG  descriptions. 


Section  4.2.3  -  Page  316 


4.2.3  Prepos 1 t Ion  Groups 

Comparing  the  phrase  "a  red  cube  which  supports  a  pyramid" 
with  the  phrase  "a  red  cube  under  a  pyramid,  we  see  that 
relative  clauses  and  qualifying  prepositional  phrases  are  very 
similar  In  structure  and  meaning.  In  fact,  their  semantic 
analysis  Is  almost  Identical.  The  definition  of  a  preposition 
like  "under"  uses  the  same  function  as  the  definition  of  a  verb 
like  "support",  saying  "If  the  semantic  subject  and  object  are 
both  physical  objects,  then  the  object  Is  #ABOVE  the  subject" 
(Pemembe»  that  In  our  BLOCKS  world  we  chose  to  represent  all 
vertical  space  relations  using  the  concept  #AB0VE).  This  can  be 
formal  I  zed  as: 

(CMEANS((((#PHYSOB)){(#PHYSOB)))(#ABOVE  #2  #1)NIL) 

Again,  the  symbols  #1  and  #2  refer  to  the  semantic  subject 
and  semantic  first  object,  but  In  the  case  of  a  preposition 
group  used  as  a  qualifier,  the  SMSU6  Is  the  MG  of  which  the 
PREPG  Is  a  part,  while  the  SMOBl  Is  the  object  of  the  PREPG  (the 
PREPOBJ).  As  with  clauses,  the  situation  may  be  more  complex. 
For  example.  In  a  sentence  like  "Vlho  was  the  antelope  I  saw  you 
with  last  night?",  the  SMOBJ  of  the  PREP  "with"  Is  the  question 
element  "who"  In  the  MAJOR  CLAUSE.  However,  the  PREPG 
specialist  (SMPREP)  takes  care  of  all  this,  and  In  defining  a 
preposition,  we  can  deal  directly  with  the  SMSUB  and  the  SMOBl. 
Notice  that  If  we  had  been  defining  "above"  Instead  of  "under", 
everything  would  have  been  the  same  except  that  the  relation 
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would  have  been  (#ABOVE  #i  #2)  Instead  of  (#ABOVE  #2  #1).  If 
the  PREPG  Is  an  adjunct  to  a  CLAUSE,  the  SMSUBd  Is  the  RSS 
defining  the  CLAUSE.  The  definition  of  a  preposition  can  then 
use  the  semantic  markers  which  are  Included  In  an  RSS. 
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4.2.4  Types  of  Object  Desert  Pt I ons 

In  the  examples  so  far,  all  of  the  objects  described  have 
been  singular  and  INDEFtnIte,  like  "j,  red  cube",  and  the 
semantic  system  has  been  able  to  assign  them  a  PLANNER  variable 
and  use  It  In  building  their  properties  Into  the  description. 

Let  us  consider  another  simple  case,  a  DEFInlte  object,  as  In  "a 
red  cube  which  supports  the  pvrami d" . 

The  analysis  begins  exactly  as  It  did  for  the  earlier  case, 
building  a  description  of  "red  cube",  then  one  of  "pyramid." 

The  "pyramid"  description  differs  from  OSS  2  In  having  DEF  In 
place  of  INDEF  In  Its  determiner.  This  Is  noted  at  the  very 
beginning  of  the  analysis,  but  has  no  effect  until  the  entire  NG 
(Including  any  qualifiers)  has  been  parsed.  At  that  time,  the 
second  NG  specialist  SMNG2  checks  for  a  definite  NG  and  tries  to 
determine  what  It  refers  to  before  going  on  (we  have  pointed  out 
In  various  places  how  this  Is  used  to  guide  the  parsing).  It 
takes  the  PLANNER  description  which,  has  been  built  up,  and  hands 
It  to  PLANNER  In  a  THFIND  ALL  expression.  The  result  Is  a  list 
of  all  objects  fitting  the  description.  Presumably  If  the 
speaker  used  "the",  he  must  be  referring  to  a  particular  object 
he  expects  the  listener  to  be  aware  of.  If  more  than  one  object 
fits  the  description,  there  are  various  discourse  heuristics 
used  to  find  the  reference,  (see  Section  4.3.3)  and  If  nothing 
succeeds,  a  failure  message  Is  produced  and  the  parser  has  to 
back  up  and  try  something  else  to  parse  the  NG. 


oi«*< 
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If  SMNG2  Is  able  to  find  the  jbject  being  referred  to^  It 

puts  It  Into  the  description  (on  the  property  list).  When  SMRSQ 

relates  the  descriptions  to  build  the  meaning  of  "a  red  cube 

which  supports  the  pyramid"  It  takes  advantage  of  this.  The 

object  found  will  have  a  proper  name  like  :B5.  Instead  of 

building  the  PLANNER  description  of  OSS  3/  It  builds: 

((XI)  200  (THG0AL(#IS  $?X1  #SL0CK)) 

(#EQDIM  $?X1) 

(THG0AL(#SUPP0RT  $7X1  :B5))) 

Figure  65  --  PLANNER  Description  2 
"a  red  cube  which  supports  the  pyramid" 

The  object  Itself  Is  used  In  the  relation  rather  than  dealing 

with  Its  description. 

What  If  we  had  asked  about  "a  red  cube  which  supports  three 
pyramids"?  In  that  case  the  PLANNER  description  would  Include 
an  expression  using  the  function  THFIND  with  a  numerical 
parameter/  as  shown  In  Figure  66.  If  we  had  said  "a  red  cube 
which  supports  at  most  two  pyramids"/  a  fancier  THFINO  parameter 
would  have  been  used/  as  shown.  Here/  the  parameter  means  "be 
sat.sfled  If  you  don't  find  any,  but  If  you  find  J,  Immediately 
cause  a  failure."  In  addition  to  numbers/  the  SMNGl  and  RSQ 
programs  can  work  together  to  relate  descriptions  of  quantified 
objects.  "A  red  cube  which  supports  some  pyramid"  Is  handled 
just  like  the  original  Indefinite  case.  "A  red  cube  which 
supports  jia  pyramid"  and  "a  red  cube  which  supports  every 
pyramid"  are  handled  using  the  other  PLANNER  primitives.  A 
universal  quantifier  Is  translated  as  "there  Is  no  pyramid  which 
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(THG0AL(#IS  $?X2  ^PYRAMID)) 

(THG0AL(#SUPP0RT  $?X1  $?X2)) 

"which  supports  a  pyramid" 

(THG0AL(#SUPP0RT  $?X1  :B3)) 

"which  supports  the  pyramid" 


(THFIND  3  $?X2  (X2)  (THG0AL(#IS  $?X2  #PYRAMID)) 

(THG0AL(#SUPP0RT  $?X1  $?X2))) 

"which  supports  three  pyramids" 


(THFIND  (0  3  NIL)  $?X2  (X2)  (THG0AL(#IS  $?X2  #PYRAMI0)) 

(THG0AL(#SUPP0RT  $?X1  $?X2))) 

"which  supports  at  most  two  pyramids" 


(THNOT 

(THPROG  (X2)  (THG0AL(#IS  $?X2  #PYRAMID)) 

(THG0AL(#SUPP0RT  $?X1  $?X2))))) 


"which  supports  no  pyramids" 


(THNOT 

(THPROG  (X2)  (THG0AL(#IS  $?X2  #PYRAMID)) 

(THNOT 

(THG0AL(#SUPP0RT  S?X1  $?X2))))) 
"which  supports  every  pyramid" 


Figure  66  --  Quantifiers 
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the  red  cube  does  not  support".  For  the  robot#  "every"  means 
"every  one  I  know  about".  This  (s  not  a  requirement  of  PLANNER# 
or  even  of  the  way  we  have  set  up  our  semantic  programs.  It  v;as 
done  as  a  convenience#  and  will  be  changed  when  the  system  Is 
expanded  to  discuss  universal  statements  as  well  as  the  specific 
commands  and  questions  It  novr  handles. 

We  similarly  handle  the  v<hole  range  of  quantifiers  and 
types  of  numbers#  using  the  logical  primitives  and  THFIMD 
parameters  of  PLANNER.  The  work  Is  actually  done  In  two  places. 
SMNGl  takes  the  vrords  and  syntactic  features#  and  generates  the 
"determiner"  which  v/as  one  of  the  Ingredients  of  our  semantic 
structure  for  objects.  The  determiner  contains  three  parts. 
First#  the  number  Is  either  NS  (singular#  but  not  with  the 
specific  number  "one")#  MPL  (plural  with  no  specific  number)# 
NS-PL  (ambiguous  between  the  two#  as  In  "the  fish")#  or  a 
construction  containing  an  actual  arithmetic  number.  This  can 
either  be  the  number  alone#  or  a  combination  with  ">"#  "<"# 
"exactly".  Thus  the  tv/o  NGs  "at  most  two  days"  and  "fewer  than 
three  days"  produce  the  Identical  determiner#  containing  "(< 

3)".  The  second  elem'int  of  the  determiner  Is  either  OFF# 

INDEF#  ALL#  NO#  or  NDtT  (no  determiner  at  all  --  as  In  "We  like 
sheep.")  The  third  Is  saved  for  the  question  types  HOWMANY  and 
WHICH#  so  It  Is  NIL  In  a  NG  which  Is  not  a  QUEST  or  REL. 
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Nutnbor 

NS 

an  apple 

NPL 

some  thoughts 

7 

seven  sisters 

(>  2) 

at  least  three  ways 

(<  5) 

fewer  than  five  people 

(EXACTLY  2) 

exactly  two  minutes 

Oe terni ner 

OEF 

the  law 

INDEF 

a  riot 

ALL 

every  child 

NO 

noth i ng 

NDET 

good  intentions 

Question  Marker 

HOWMANY 

how  many  years 

WH  1  CH 

which  road 

Figure  67  -- 

Examples  of  Determiner  Elements 

Other  specialists  such  as  SMRSO  and  the  answering  routines 
use  this  Information  to  produce  PLANNER  expressions  like  the 
ones  described  above.  In  addition#  there  are  special  programs 
for  cases  like  the  OF  NG#  as  In  "all  your  dreams".  In  this 
case#  the  PREP08J  following  "of"  Is  evaluated  as  a  NG  first. 
Therefore  In  "three  of  the  blocks"#  we  analyze  "the  blocks" 
first#  and  since  it  Is  definite#  PLANNER  Is  called  to  find  out 
what  it  refers  to.  It  returns  a  list  of  "the  blocks"#  (e.g. 

(:B1  :B4  :B6  iB7)).  The  OF  specialist  uses  the  PLANNER  function 
THAMONG  (which  chooses  its  variable  bindings  from  "among"  a 
given  list)  to  produce  an  expression  like: 

(THFIND  3  $?X1  (Xl>  (THAMONG  XI  (QU0TE(;B1  :B4  :B6  :B7)))) 


Section  4.2.4  -  Page  323 


Ordinals  are  treated  specially,  along  with  SUPerlatIve 
ADJectIves.  If  we  have  a  MG  like  "the  biggest  block  which 
supports  a  pyramid”.  It  Is  Impossible  for  to  add  the 

meaning  of  "biggest”  to  the  description  In  the  same  way  as  It 
would  add  an  expression  for  "big”.  The  block  Is  "biggest"  with 
respect  to  a  group  cf  objects,  and  that  group  Is  not  fully 
defined  until  the  entire  MG  has  been  parsed.  Including  the 
qualifiers.  SMMGl  therefore  does  a  partial  analysis  of  the 
meaning,  looking  up  the  name  of  the  measure  that  particular 
adjective  refers  to,  then  hangs  the  result  In  the  last  niche  of 
the  OSS  described  In  section  4.2.1  After  all  has  been  parsed, 
SMMG2  finds  It  there  and  creates  a  full  logical  description.  In 
the  case  of  "the  biggest  block  v/hich  supports  a  pyramid",  we 
would  get  the  PLAMMEP.  description: 


((XI  X2  X3  X4  )  200 

(THG0AL(#IS  $?X1  #BL0CK)) 

(THG0AL(#1S  $?X2  ^PYRAMID)) 

(THG0aL(#SUPP0RT  $?X1  5?X2)) 

(THNOT 

(THAND(THGOAL(#IS  $?X3  #BL0CK)) 

(THG0AL(#IS  $?X4  #PYRAMID)) 
(THG0AL(#SUPP0RT  $?X3  $?X4)) 
(THG0AL(#M0RE  #SIZE  $?X3  $?X1))))) 

Figure  68  --  PLANNER  Description  2 
"the  biggest  block  which  supports  a  pyramid" 


A  similar  type  of  description  Is  generated  for  other 


superlatives  and  ordinals 
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4,2.5  The  Mean i ng  of  Quest  ions 

So  far,  we  have  discussed  the  semantics  of  objects  and  the 
relationships  which  are  used  to  describe  them  tn  preposition 
groups  and  relative  clauses.  Now  we  will  deal  with  the  overall 
meaning  of  a  sentence  as  an  utterance  --  as  a  statement,  a 
question,  or  a  command.  The  sentence  is  analyzed  Into  a 
relationship  semantic  structure,  and  the  system  must  act  on  ft 
by  responding,  taking  an  action,  or  storing  some  knowledge. 

First  let  us  look  at  questions.  in  describing  the  grammar 
of  clauses  (see  section  2.3.3)  we  pointed  out  the  similarities 
between  questions  and  relative  clauses,  which  share  a  large  part 
of  the  system  network  and  the  parsing  program.  They  also  have 
much  In  common  on  a  semantic  level.  We  can  look  at  most 
questions  as  being  a  relative  clause  to  some  focus  element  In 
the  sentence. 

In  the  class  of  WH  questions,  this  resemblance  Is  easy  to 
see.  First  we  can  take  a  NGQ  question,  whose  question  element 
Is  a  MG.  The  question  "Which  red  cube  supports  a  pyramid?"  Is 
very  closely  related  to  the  NG  "a  red  cube  which  supports  a 
pyramid.  The  system  can  answer  such  a  question  by  relating  the 
clause  to  the  object,  and  building  a  description  of  "a  red  cube 
which  supports  a  pyramid."  It  then  takes  this  entire  PLANNER 
description  and  puts  It  Into  a  THFIMD  ALL  statement,  which  Is 
evaluated  In  PLANNER.  The  result  Is  a  list  of  objects  fitting 
the  description,  and  Is  In  fact  the  answer  to  om  question.  Of 
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course  PLANNER  might  find  several  objects  or  no  objects  meeting 
the  description.  In  this  case  we  need  answers  like  "none  of 

r> 

them"  or  "two  of  them".  Section  4.4  describes  hew  responses  to 

questions  such  as  these  are  generated,  depending  on  the  relation 

between  the  specific  question  and  the  data  found.  If  the  * 

question  Is  "how  many"  Instead  of  "which",  the  system  goes 

through  the  Identical  process,  but  answers  by  counting  rather 

than  naming  the  objects  found. 

No  matter  what  type  of  NGQ  we  have  (there  Is  a  tremendous 
variety  --  see  section  2.3.3)  the  same  method  works.  We  treat 
the  MAJOR  clause  as  a  relative  clause  to  the  MG  which  Is  the 
question  element,  and  which  we  call  the  focus.  This  Integrates 
the  relationship  intended  by  the  clause  Into  the  description  of 
that  object.  PLANNER  then  finds  all  objects  satisfying  the 
expanded  description,  and  the  results  are  used  to  generate  an 
answer. 

Next,  V'je  have  the  QADJ  questions,  like  "when",  "why",  and 
"how".  In  these  cases  the  focus  Is  on  an  event  rather  than  on 
one  element  of  the  relation.  If  we  ask  "Why  did  you  pick  up  a 
block?",  we  arc  referring  to  an  event  which  was  stored  In  the 
system's  memory  as  (fPICKUP  E23  :B5)  where  ;B5  Is  the  name  of 
the  object  picked  up,  and  E23  Is  the  arbitrary  name  which  was 
assigned  to  the  event  (see  Section  3.4  for  a  description  of  the 
way  such  Information  Is  stored.)  We  can  ask  In  PLANNER; 


4 
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(THFIMD  ALL  $?EVENT  ($?EVENT  $?X) 

(THGOAL(#PICKUP  $?EVENT  $?X)) 

(THG0AL(#IS  S?X  IBLOCK))) 

In  other  words,  "Find  all  of  the  events  In  which  you  picked  up  a 
block."  This  Is  clearly  the  first  thing  which  must  be  done 
before  we  can  answer  "why".  Once  It  has  been  done,  answering  Is 
easy,  since  PLANNER  will  return  as  the  value  of  THFIND  a  list  of 
names  of  such  events.  On  the  property  list  of  an  event  we  find 
the  name  of  the  event  for  which  It  was  called  as  a  subgoal  (the 
"reason").  Vie  need  only  to  describe  this  In  English.  Similarly 
if  the  question  Is  "when",  the  property  list  of  the  event  gives 
its  starting  and  ending  times.  If  the  question  Is  "how"  It 
takes  a  little  more  work,  since  the  subgoal  tree  Is  stored  with 
only  upward  links.  But  by  looking  on  the  EVENTLIST,  the  system 
can  generate  a  list  of  all  those  goals  which  had  as  their  reason 
the  one  mentioned  In  the  sentence. 

This  concept  of  a  relation  as  a  sort  of  object  called  an 
"event"  Is  useful  In  other  parts  of  the  semantics  as  well  --  for 
instance  in  dealing  with  embedded  clauses  as  In  "the  block  which 
I  told  you  to  pick  up".  This  is  described  In  section  4.2.12. 

"V^here"  Is  sometimes  handled  differently,  as  It  may  be 
either  a  constituent  of  a  clause,  such  as  a  location  object 
(LOBJ)  (In  "Where  did  you  put  It?")  or  an  ADJUNCT  (as  In  "VIhere 
did  you  meet  him?").  The  first  case  is  handled  just  like  the  NG 
case,  making  the  clause  a  relative,  as  If  It  were  "the  place 
where  you  put  it",  then  asking  In  PLANNER: 
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(THFIND  ALL  $?PLACE  (PLACE  EVENT) 

(THGOAL  (#PUT  $?EVENT  :OBJ  $?PLACE))) 

The  ADJUNCT  case  Involves  thinking  about  a  special  ^LOCATION 

assertion,  as  In: 

(THFIND  ALL  $?PLACE  (PLACE  EVENT) 

(THGOAL(#MEET  $?EVENT  :YOU  :HIM)) 

(THGOAL(#LOCATION  $?EVEMT  $?PLACE))) 

In  this  example,  we  have  moved  away  from  the  BLOCKS  world  since 

It  does  not  yet  contain  any  actions  In  Its  vocabulary  which 

occur  at  a  specific  place  without  that  place  being  mentioned  In 

the  event,  such  as  #PUT.  However  the  semantic  system  Is 

perfectly  capable  of  handling  such  cases. 

So  far,  we  have  seen  thi,t  we  can  answer  WH-  questions  by 
pretending  they  are  a  relative  to  some  object,  event,  or  place, 
and  by  adding  the  relationship  to  the  description  of  this  focus. 
It  Is  an  Interesting  fact  about  English  that  even  In  a  YES-NO 
question,  where  there  Is  no  question  element  there  Is  usually  a 
focus.  Consider  a  simple  question  like  ’*Doss  the  box  contain  a 

block?"  Someone  might  answer  "Yes,  a  red  one.",  as  If  the 
question  had  been  "VIhIch  block  does  the  box  contain?"  Notice 
that  "Yes,  the  box."  v^ould  not  have  been  an  appropriate  answer. 
Something  about  "the  box"  makes  It  obvious  that  It  Is  not  the 
focus.  It  Is  not  Its  place  as  subject  or  object,  since  "Is  a 
block  In  the  box?"  reverses  these  roles,  but  demands  the  same 
answer.  Clearly  It  Is  the  fact  that  "a  block"  Is  an  INDEFInlte 
NG. 

The  fact  that  a  speaker  says  "a  block"  Instead  of  "the 
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block"  Indlcotes  that  he  Is  not  sure  of  a  specific  object 
referred  to  by  the  description.  Even  If  he  does  not  Inquire 
about  It  specifically/  the  listener  knows  that  the  information 
will  be  new/  and  possibly  of  Interest  since  he  mentioned  the 
object.  In  answering  "Does  the  box  contain  a  block?",  our 
system  does  the  same  thing  It  would  do  with  "How  many  blocks 
does  the  box  contain?".  It  adds  the  relation  "contained  by  the 
box"  to  the  description  of  "a  block",  and  finds  all  of  the 
objects  meeting  this  description.  Of  course  the  verbal  answer 
Is  different  for  the  two  types  of  question.  In  one  case,  "Yes" 
Is  sufficient,  while  in  the  other  "one"  Is.  But  the  loglca 
deduction  needed  to  derive  it  Is  Identical.  In  fact,  our  system 
uses  this  extra  Information  by  replying,  "Yes,  two  of  them;  a 
red  one  and  a  green  one."  This  may  sometimes  be  verbose,  but  ?n 
fact  gives  a  natural  sound  to  the  question-answering.  It  takes 
on  the  "Intelligent"  character  of  telling  the  questioner 
Information  he  would  be  Interested  In  knowing,  even  when  he 
doesn't  ask  for  It  explicitly. 

In  YES-NO  questions.  It  Is  not  always  easy  to  determine  the 
focus.  Only  an  INDEF  NG  which  Is  not  embedded  In  another  NG  can 
be  the  focus,  but  there  may  be  several  of  them  In  a  sentence. 
Sometimes  there  Is  no  way  to  choose,  but  that  Is  rare.  In 
«  asking  a  question,  people  are  usually  focusing  their  attention 

cn  a  particular  object  or  event.  There  are  a  number  of  devices 
for  Indicating  the  focus.  For  example  a  quantifier,  like  "any" 
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or  a  TPRON  like  "something"  emphasizes  the  NG  more  than  a  simple 
determiner  like  "a".  In  both  "Does  anything  green  support  a 
block?",  and  "Does  a  block  support  anything  green?",  the  phrase 
"anything  green"  Is  the  focus.  When  none  of  these  cues  are 
present,  the  syntactic  function  of  the  NG  makes  a  difference. 

If  we  ask  "Is  there  a  block  on  a  table",  then  "block"  Is  the 
focus,  since  It  Is  the  subject  while  "table"  Is  Inside  a  PREPG. 
Our  system  contains  a  heuristic  program  which  takes  Into  account 
the  kind  of  determiners,  number  features  (singular  Is  more 
likely  than  plural),  syntactic  position,  and  other  such  factors 
In  choosing  a  focus.  If  It  Is  In  fact  very  difficult  to  choose 
In  a  given  case,  it  Is  likely  that  the  speaker  will  be  satisfied 
wl th  ary  choice. 

For  sentences  In  the  past  tense,  which  contain  no  focus  MG, 
we  can  again  have  an  event  as  a  focus.  If  we  ask,  "Old  Jesse 
James  rob  the  stagecoach?",  a  possible  answer.  Interpreting  the 
event  as  the  focus.  Is  "Yes,  three  times:  yesterday,  last  week, 
and  a  year  ago."  This  Is  closely  parallel  to  answering  questions 
In  which  the  focus  Is  an  object. 

There  are  some  questions  which  have  no  focus,  such  as 
present-tense  clauses  with  only  definite  noun  groups.  These, 
however,  are  even  easier  to  answer,  since  they  can  be  expresssed 
In  the  form  of  a  simple  set  of  assertions  with  no  variables. 

The  NG  analysis  finds  the  actual  objects  referred  to  by  a 
definite  MG,  and  these  are  used  In  place  of  the  variable  In 
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relationships.  We  cin  therefore  answer  "yes*'  or  "no"  by  making 
a  goal  of  the  relationship  and  letting  PLANNER  evaluate  It.  The 
question  "Dees  the  red  cube  support  the  box?"  would  generate  the 
♦  simple  PLANNER  expression 

(THGOAL  (ISUPPORT  ;B3  :B0X)) 

if  :B3  Is  the  Internal  name  for  the  red  cube.  PLANNER  would 
return  a  non-NIL  value  only  If  the  answer  were  "yes". 


¥ 
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The  system  can  accept  commands  In  the  form  of  IMPERATIVE 


sentences.  These  are  handled  somewhat  differently  from 
questions.  If  they  contain  only  definite  objects,  they  can  be 
treated  In  the  way  mentioned  above  for  questions  with  no  focus. 
The  command  "Pick  up  the  red  ball.".  Is  translated  Into  the 
relationship  (#PICKUP  ;B7)  which  can  be  evaluated  directly  by 
putting  It  In  a  THGOAL  statement  which  will  carry  out  the 
act  Ion: 


(THGOAL  (#PICKUP  ;B7)(THUSE  TC-PICKUP)) 

However,  If  we  say  "Pick  up  a,  red  ball.",  the  situation  Is 
different.  We  could  first  use  THFIND  to  find  a  red  ball,  then 
out  this  object  In  a  simple  goal  statment  as  've  did  with  "the 
red  ball".  This,  however,  might  be  a  bad  Idea.  In  choosing  a 
red  ball  arbitrarily,  we  may  choose  one  which  Is  out  of  reach  or 
which  is  suppoi*tlng  a  tower.  The  robot  might  fall  or  be  forced 
to  do  a  lot  of  work  which  It  could  have  avoided  with  a  little 
thought. 

We  want  to  send  the  theorem  which  works  on  the  goal  a 
description  rather  than  an  object  name,  and  let  the  theorem 
choose  the  specific  object  to  be  used,  according  to  the  criteria 
which  best  suit  It.  This  Is  the  method  we  have  adopted. 

Remember  that  each  OSS  has  a  name  like  "NG45".  Before  a  clause 
Is  related  to  Its  objects,  these  are  the  symbols  used  In  the 
rel at lonsh I p. 
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When  we  analyze  "Pick  up  a  red  ball",  it  will  actually 
produce  (^PICKUP  NG45),  where  NGUS  names  an  OSS  describing  "a 
red  ball."  We  use  this  directly  as  a  goal  statement,  calling  a 
special  theorem  which  knows  how  to  use  these  descriptions.  The 
theorem  calls  a  theorem  named  TC-F INDCHOOSE,  which  uses  the 
description  of  the  object,  along  with  a  set  of  "desirable 
properties"  associated  with  objects  used  for  trying  to  achieve 
the  goal.  #PICKIP  may  specify  that  it  would  prefer  picking  up 
something  which  doesn't  support  anything,  or  which  is  near  the 
hand's  current  location.  Each  theorem  can  ask  for  whatever  it 
wants.  Of  course.  It  may  be  Impossible  to  find  an  object  which 
fits  all  of  the  requirements,  and  the  theorem  has  to  be 
satisfied  with  what  it  can  get.  TC-F I NDCHOOSE  tries  to  meet  the 
full  specifications  first,  but  If  it  can't  find  an  object  (or 
enough  objects  In  the  case  of  plural),  it  gradually  removes  the 
restrictions.  It  must  always  keep  the  full  requirements  of  the 
description  input  In  English  In  order  to  carry  out  the  specified 
command.  The  robot  simply  tries  to  be  clever  about  choosing 
those  objects  which  fit  the  command  but  are  also  the  easiest  for 


J  t  to  use 
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In  addition  to  questions  and  commands,  the  system  can 


accept  declarative  sentences.  We  have  Intentionally  not 
emphasized  them,  as  there  a*’e  theoretical  problems  and  dan;;ers 
In  designing  a  program  to  accept  Information  In  this  way.  In 
Chapter  3,  we  discussed  the  complex  world-model  a  person  has  and 
explained  why  we  felt  that  Intelligence  needed  a  highly 
structured  and  coordinated  body  of  knowledge  rather  than  a  sot 
of  separate  uniform  facts  or  axioms.  It  Is  comparatively  easy 
to  get  a  program  to  add  new  Information  of  the  second  type,  but 
very  difficult  to  get  It  to  add  the  first,  since  this  Involves 
understanding  the  relationship  betv/een  the  new  Information  and 
whatever  Is  already  there. 


Therefore,  although  we  have  Included  declarative  sentences 
In  our  dialog  (and  they  are  fully  handled  In  the  grammar),  we 
believe  that  before  trying  to  "tell"  many  things  to  a  program, 
we  need  to  have  a  better  Idea  of  how  knov/ledge  should  be 
structured,  and  the  program  should  approach  new  Information  as  a 
problem  solving  activity  rather  than  a  clerical  one. 

When  a  human  sees  a  new  sentance,  he  does  rot  simply  store 
It  away,  but  he  relates  It  to  what  he  already  knows,  perhaps 
changing  his  "programs",  or  Ignoring  the  content  of  the  sentence 
and  Interpreting  something  about  the  person  who  said  It.  A 
language  understander  needs  to  have  an  Interpreter  which  looks 
at  each  r  w  sentence  and  decides  how  to  use  It.  This  may 
Include  checking  It  for  consistency  with  what  It  already  knows, 
creating  new  data  or  types  of  data  In  Its  storage,  modifying 
theorems,  and  many  ©the-  possibilities.  This  Is  discussed 
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further  In  Section  5.1 

In  our  system  we  have  four  different  ways  In  which 
Information  can  be  accepted  In  a  declarative  sentence.  The 
first  Is  a  simple  word  definition  facility.  If  we  say  "A  'marb' 
Is  a  red  block  which  Is  behind  a  box.”#  the  system  recognizes 
that  we  are  defining  a  new  word.  It  currently  recognizes  this 
by  the  quote  marks,  but  It  could  just  as  easily  declare  all 
unfamiliar  words  as  possible  new  words.  We  have  not  done  this 
as  It  would  eliminate  the  feature  that  the  system  Immediately 
recognizes  typing  errors  without  waiting  to  begin  parsing  the 
sentence. 

In  this  kind  of  definition,  the  complement  of  the  sentence 
is  a  noun  group,  which  has  an  OSS.  We  save  this  OSS  and 
generate  a  new  dictionary  entry  for  the  word,  defined 
syntactically  as  a  noun,  and  .vith  Its  semantic  definition  being 
the  program  "set  the  object  description  to  this  one  we  saved 
earlier,"  Remember  that  all  definitions  are  programs,  so  this 
one  fits  In  with  no  problem.  When  It  Is  called  on  to  build  part 
of  the  descrlptic'.  it  simply  Inserts  the  description  used  to 
define  It.  I f  we  talk  about  "two  big  marbs",  the  system  will 
build  a  description  exactly  like  the  one  for  "tv;o  big  red  blocks 
which  are  behind  a  box." 

The  second  kind  of  information  the  system  accepts  is  simple 
assertions  involving  a  predicate  for  which  It  does  not  have 
complete  knowledge.  As  we  mentioned  In  Section  3.4,  the  system 
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hps  complete  data  about  the  physical  character  I s 1 1 cs  of  the 
objects  In  the  scene.  He  have  selected  #LIKE  as  an  arbitrary 
relation  about  which  the  system  knows  nothing  except  what  It  Is 
told  In  the  dialoc.  If  we  say  "I  like  ycu.”  this  produces  the 
assei'tlcn  (#LIKE  tFRlEHO  tSHRDLU)  (the  name  of  the  robot  Is 
:SHR0LU)  which  Is  simply  added  to  the  data  base.  The  system 
also  plays  a  trick  with  the  adjective  "nice".  Instead  of  having 
some  concept  of  fNICE#  It  assumes  that  the  use  of  "nice"  In 
describing  something  Is  really  saying  more  about  the  speaker 
than  the  object/  so  the  definition  of  "nice"  Is 
(nMEAMS((#THIK'G)((<'LiKE  :  FRIEND  ***)))) 

In  other  wordS/  the  person  who  uses  the  word  "nice"  likes  the 
object  he  was  referring  to. 

If  we  use  an  object  which  Isn't  definite,  as  In  "I  like  red 
blocks."/  the  system  uses  the  object  description  to  generate  a 
simple  PLANNER  consequent  theorem.  It  creates  a  theorem  of  the 
form: 

(THCONSE  (XI) 

(#LIKE  :FRIEND  $?X1) 

(THGOAL  (#IS  $?X1  #BLOCK)) 

(THGOAL  (#COLOR  $?X1  #RED))) 

This  theorem  says  "Whenever  you  want  to  prove  that  the  user 
likes  something,  you  can  do  It  by  proving  that  It  Is  a  block  and 
it  Is  red."  This  Is  added  to  the  theorem  data  base,  and  can  be 
used  to  answer  questions  or  carry  out  deductions  Involving 
objects  described  as  "nice".  The  system  does  not  separate  types 
of  non-definite  objects  and  assumes  universal  quantification. 
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The  results  would  have  been  the  same  If  the  sentence  used  "any 
red  block",  "every  red  block",  "all  red  blocks",  or  (wrongly)  "a 
red  block."  A  more  complete  treatment  Is  one  of  the  possible 
extensions  of  the  system. 

It  does  notice  the  form  "no  red  blocks"  and  uses  this  for 
the  fourth  kind  of  Information.  It  sets  up  an  almost  Identical 
theorem,  but  with  a  "kicker"  at  the  end.  If  we  say  "I  like  no 
red  blocks.".  It  sets  up  the  theorem: 

(THCONSE  (Xn 
(/LIKE  :FRIEND  S?X1) 

(TMGOAL  (#IS  $7X1  /BLOCK)) 

(THGOAL  (/COLOR  S7X1  /RED))) 

(THFAIL  THGOAL)) 

When  the  system  Is  trying  to  prove  that  we  like  something, 
this  theorem  is  called  just  like  the  jne  above.  But  this  time, 
after  it  find*^  out  that  the  object  is  a  red  block.  It  does  not 
succeed.  Instead^  it  uses  the  PLANNER  function  THFAIL  In  a 
powerful  way.  Instead  of  just  causing  that  theorem  to  fall.  It 
causes  the  entire  goal  to  fall,  regardless  of  what  other 
theorems  theie  are.  We  can  also  accept  a  sentence  like  this 
with  a  positive  NG  but  a  negative  clause,  as  In  ’’ i  don't  like 
the  red  block"  or  "I  don't  like  any  red  blocks." 


wx»r4 
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IlJLuS  Ili2£ 

One  of  the  most  complex  parts  of  En;t1lsh  semantics  Is  the 
way  of  establishing  temporal  relationships.  It  v/as  pointed  out 
earlier  that  one  of  the  primary  differences  between  the  clause 
and  other  ur.lts  such  as  the  NG  or  PREPG  Is  the  special  set  of 
mechanisms  within  the  clause  for  handling  tlm“.  In  this  section 
we  will  describe  how  those  mechanisms  operate  both  within  the 
clause  and  at  other  levels  of  syntax. 

In  our  formalism  for  describing  relations  and  events  (see 
section  5.1)  there  Is  provision  for  Including  a  time  reference 
In  a  relation.  The  sentence  "Harriet  saw  the  film  last  week." 
might  be  represented  as 

(#SEE  :HARRIET  :FILM  :riME23) 

where  ;TIME23  Is  an  arbitrary  name  for  a  structure  describing 
the  time  reference  "last  week".  The  semantic  programs  for 
dealing  with  time  con  be  described  In  three  parts  --  the  form  of 
structures  used  to  represent  time/  the  way  those  structures  are 
created/  and  the  way  they  are  used  tr:  understanding  and 
deduction. 

A.  Time  Semantic  Structures 

For  the  purposes  of  our  BLOCKS  world/  we  have  treated  only 
a  simple  part  of  the  overall  range  of  time  references  in 
English.  In  particular  we  have  dealt  only  with  references  to 
actual  events  which  have  happened  In  the  past  or  are  occurring 
In  the  present/  without  dealing  with  the  many  varieties  of 
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future  events,  possible  events,  conditional  events,  etc.  With 
this  simplification  the  system  can  use  a  simple  linear  time 
scale  (like  a  clock),  relating  all  events  to  specific  numerical 
times.  This  does  not  mean  that  a  single  event  must  occur  at  a 
single  time  •“  it  may  continue  for  a  period  of  time  during  which 
other  events  are  occurring. 

Sngllsh  makes  a  clear  distinction  between  events  which  are 
thought  of  as  occurring  at  a  particular  time,  and  those  which 
are  pictured  as  continuing  over  an  Interval.  This  contrast  Is 
expressed  both  In  the  choice  of  verbs  and  in  the  shape  of  the  VG 
containing  the  verb. 

Verbs  like  "like",  and  "know",  are  Inherently  progressive. 
They  express  a  relationship  which  continues  over  a  period  of 
time.  Verbs  like  "hit",  and  "write"  are  not  progressive,  but 
indicate  the  completion  of  an  action  as  a  whole.  Of  course, 
this  action  also  Involves  a  process,  and  there  Is  a  way  to 
express  this  aspect  by  using  a  tense  PRESENT  IN...  The  sentence 
"i  broke  it."  Is  not  progressive,  giving  the  feeling  of  a  single 
momentary  act.  "I  was  breaking  It."  emphasizes  the  process  of 
breaking,  to  v/hlch  other  events  can  be  related. 

In  the  present  tense,  the  distinction  Is  clear.  The 
present  of  a  progressive  verb  has  the  expected  meaning,  as  In  "I 
know  your  name."  With  a  non-progressive  verb,  there  Is  a  special 
meaning  of  habitual  or  repeated  action,  as  In  "I  break  bottles." 
In  order  to  produce  the  meaning  usually  considered  "present". 
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the  verb  group  must  be  PfJESEfIT  Ifl  PRESENT,  as  In  ”1  am  breaking 
bottles." 

Ambiguities  can  arise  from  verbs  which  are  both  progressive 
and  non-progressive.  The  question  "Did  the  red  block  touch  the 
green  one  while  you  were  building  the  stack?"  has  two 
Interpretations.  One  means  "Was  It  In  contact  during  that 
time?",  while  the  other  asks  "Old  It  make  contact  during  that 
time?"  If  the  verb  were  replaced  by  "support",  only  the  analog 
of  the  first  meaning  would  be  valid,  while  "hit"  would  Involve 
the  second.  The  representation  for  time  references  must  take 
this  progress  I V I ty  into  account  In  trying  to  Interpret  time 
modifiers. 

The  representation  used  for  time  has  four  elements;  the 
tense,  an  Indicator  for  progressive,  a  starting  time  limit,  and 
and  ending  time  limit.  Either  or  both  of  the  limits  may  be 
ommitted.  Some  examples  of  sentences  and  thel’"  corresponding 
structures  are  shown  In  Figure  69. 


A  supports  B 

(PRES) 

T 

;MOW  ;NOW 

A  supported  B  before  time  23 

(PAST) 

T 

NIL  23 

A  hit  B  before  time  23 

(PAST) 

NIL 

MIL  23 

You  built  It  after  time  24 

(PAST) 

NIL 

24  NIL 

You  were  building  It  after  time  24 

(PAST) 

T 

24  NIL 

Figure  69  --  Time  Semantic 

Structures 

The  difference  between  the  last  two  examples  In  Figure  69 
can  be  visualized  by  drawing  a  time  line: 
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Non-Prop.ress  I  ve 

You  built  It  after  time  24 

"■"<  )— . 

time  begin 
24  building 

Progress  I ve 

You  were  building  It  after  time  24 
■  '■  — ■'  I 

- !  finish 

time  building 

24 

Figure  70  --  Progressive  and  non-Progress I ve  Times 
A  non-progressive  action  must  begin  after  the  start  time, 
and  end  before  the  end  time.  A  progressive  one  begins  before 
the  start  time  and  ends  after  the  end  time.  The  TSS  ■'or  "you 
hit  It  during  event  23"  (assuming  event  23  began  at  time  3  and 
ended  at  7}  would  be 
(PAST)  NIL  3  7 

I.e,  the  hit  began  after  event  23  started  and  ended  before  It 
ended.  The  sentence  "you  were  hitting  It  during  event  23"  would 
be: 

(PAST)  T  7  3 

I.e.  the  hitting  began  before  event  23  was  over,  but  ended  after 
It  had  begun.  This  covers  all  ways  of  having  the  two  events 
overlap.  The  definitions  of  the  relating  words  like  "during" 
and  "bafore"  do  not  have  explicit  mention  of  this  distinction, 
but  the  semantic  analysts  programs  take  Into  account  whether  the 
verb  and  VG  are  progressive  In  setting  up  the  TSS. 
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B,  Setting  up  Time  Structures 

Tlrr*e  Semantic  Structures  are  associated  with  clauses^  and  a 
new  one  Is  generated  each  time  a  clause  Is  parsed.  Its  elements 
are  determined  by  different  aspects  of  the  clause  structure  -- 
the  tense  depends  on  the  form  of  the  VG,  the  progress  I vl ty 
depends  on  both  the  tense  and  the  specific  verb^  ani  the  limits 
are  set  by  modifiers  such  as  bound  clauses^  adverbS/  and  time 
NGs  as  well  as  by  the  tense. 

No  analysts  Is  done  until  after  the  VG  Is  parsed  and  the 
tense  established.  Some  types  of  secondary  clauses  such  as  IMG/ 
SUBING/  TO/  and  SUBTO  do  not  Indicate  a  tense.  There  Is  a 
potential  ambiguity  In  determining  the  time  reference.  **The  man 
sitting  on  the  table  baked  the  bread.”  might  Indicate  that  the 
man  was  sitting  on  the  table  when  he  baked  It/  or  that  he  Is 
sitting  on  the  table  now. 

Unless  there  Is  a  specific  reference  (like  ”the  man  sitting 
on  the  table  yesterday...”)  the  system  should  take  both 
possibilities  into  account  and  resolve  them  as  it  would  an 
ambiguity  cau5;ed  by  multiple  senses  of  words.  The  current 
system  does  not  do  thIS/  but  uses  a  simplifying  heuristic.  If 
the  secondary  clause  Involves  PAST/  and  Is  embedded  In  a  PAST 
MAJOR  CLAUSE/  the  two  times  are  assumed  the  same  unless 
specifically  mentioned.  If  the  secondary  clause  has  no  tense/ 
it  Is  assumed  PRESENT.  If  It  Is  PAST/  but  Imbedded  In  a  PRESENT 
MAJOR  CLAUSE/  the  system  checks  the  time  reference  of  the 
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previous  sentence.  If  this  Is  PAST,  the  new  one  Is  assuned  to 
be  the  sane  (Includine  whatever  modifiers,  limits,  etc. 
applied).  lf  not  It  sets  up  a  general  time  structure  for  PAST, 
with  no  beginning  limit,  and  an  end  limit  of  :N0'.l.  A  PRESfiMT 
tense  TSS  is  represented  by  the  sing.e  itom  rMOVI,  which  Is 
treated  specially  by  the  programs,  and  Is  often  deleted  from 
relations  which  Interrogate  the  current  state  of  the  data  base  , 
(see  below).  It  can  be  applied  only  to  progressive  verbs  and 
tenses  (no  provision  exists  for  understanding  habitual  action). 

Modals  are  treated  like  present  tense  as  far  as 
establishing  tim.e  references.  A  more  complete  system  v/ould 
account  for  future,  different  types  of  modals,  more  complex 
tenses,  and  would  involve  heuristics  for  finding  the  referents 
of  multiple  tenses  like  "He  v\»i  1 1  have  been  going  to  go 
immediately  for  a  month  by  Tuesday." 

The  start  and  end  limits  are  set  by  modifier;.  Adverbs 
like  "yesterday"  and  TIME  NG's  like  "the  week  he  arrived"  set 
both  limits.  This  can  also  be  done  by  bound  clauses  like  "while 
you  were  building  the  stack"  or  PREPGs  like  "during  the  flood". 
Other  clauses,  prepositions,  and  groups  set  only  the  start  limit 
(like  "after  you  hit  It",  "after  the  war")  while  others  (Ilk-’ 
"before"  and  "until")  set  the  end  limit.  In  the  current  system 
the  event  being  referred  to  In  the  modifier  Is  assumed  to  be 
known  along  with  Its  exact  time  (It  must  be  In  the  past.)  The 
exact  beginning  and  ending  time  are  used  In  setting  the  limits. 
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The  question  "Did  you  pick  it  up  v/hlle  you  v/ere  building 
the  stack?"  is  answered  by  first  finding  the  event  of  bu i 1 d i r g 
the  stack  (using  a  rss  for  PAST  tense  with  no  other  limits)# 
then  using  the  beginnint  and  ending  of  that  event  as  limits  for 
the  TSS  in  the  relation  #PICKUP. 

There  ere  discourse  phenomena  which  Involve  time  referenc?. 
First#  there  are  specific  back-references  with  words  like  "then" 
and  phrases  like  "at  that  time".  The  system  keeps  track  of  the 
major  time  reference  of  the  previous  sentence#  and  substitutes 
it  in  the  current  sentence  whenever  such  phrases  are  used.  This 
time  is  also  carried  forward  Implicitly.  Consider  "Did  you  pick 
up  a  red  clock  while  you  were  building  the  tower?"  "No."  "Did 
you  pick  up  a  green  one?"  In  this  sequence#  the  second  question 
involves  a  specific  time  interval  although  It  is  not  mentioned 
again.  V/henever  there  are  tv/o  successive  PAST  sentences  and  the 
second  does  not  have  any  explicit  time  reference#  the  previous 
TSS  is  used.  Long  dialogs  can  appear  in  which  the  same  time 
Interval  Is  used  throughout#  but  Is  mentioned  only  in  the  first 
sentence. 

C.  Use  of  TSS 

So  far,  all  of  our  discussion  has  Involved  the  clause  with 
its  verb  group  and  time  modifiers.  But  in  making  use  of  time 
Information  we  must  handle  other  units  as  well.  The  sentence 
"The  man  sitting  v>r.  the  table  baked  the  bread."  has  two 
meanings#  but  the  point  would  have  been  Identical  for  "Tie  man 
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on  the  table  baked  the  bread."  The  qualifying  prepos I t ional 
phrase  "on  the  table"  does  not  refer  to  time#  but  can  be 
Interpreted  either  as  meaning  "on  th?  table  now"  or  "on  the 
table  then".  Adjectives  can  be  affected  similarly.  Consider 
the  sentences! 

a.  Many  rich  men  made  their  fortunes  during  the  depression. 

b.  Many  rich  men  lost  their  fortunes  during  the  depression. 

c.  Many  rich  men  worked  In  restaurants  during  the  depression. 

The  first  clearly  means  "men  wno  are  now  rich"#  the  second 

"men  who  were  rich"#  and  the  third  might  have  either 
Interpretation.  The  adjective  "rich"  Involves  an  Implicit  time 
reference#  as  does  any  adjective  which  describes  a  state  v/hlch 
can  be  true  of  an  object  at  one  time#  but  false  at  another. 

Nouns  can  also  Involve  states  which  are  changeable#  and  the 
problem  would  be  Identical  If  "rich  men"  were  replaced  by 
"ml  1 1  Iona  I  res". 

In  a  traditional  transformational  approach#  this  would  be 
used  to  show  that  even  a  simple  phrase  such  as  "a  rich  man"  or 
"millionaires"  Is  generated  by  a  series  of  transformations.  The 
possibility  of  two  meanings  Is  accounted  for  by  two  different 
deep  structures#  Involving  sentences  corresponding  to  "The  men 
were  rich."  and  "The  men  are  rich."  This  leads  to  a  syntactic 
theory  In  which  the  simplest  sentence  may  Involve  dozens  of  such 
transformations#  to  account  for  each  noun#  adjective# 
prespos 1 t Ion#  etc.  The  parser  must  be  able  to  handle  all  of 
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these  details  using  syntactic  information. 

in  our  aoproach/  these  can  be  seen  as  semantic  ambiguities 
which  arise  within  a  single  syntactic  structure.  Part  of  the 
semantic  definition  of  the  word  "millionaire"  (or  "student"/ 
"bachelor"/  etc.)  Involves  a  reference  to  time.  Within  the 
language  for  writing  semantic  definitions/  there  Is  a  special 
symbol  *TIME.  Whenever  the  program  for  the  meaning  of  a  word  In 
the  dictionary  Is  called/  the  semantic  system  will  have 
determined  the  appropriate  Time  Semantic  Structure  (or 
structures)  and  have  assigned  a  value  to  this  symbol 
accordingly.  If  the  time  reference  Is  amblguou**,  the  definition 
will  be  called  once  for  each  possibility.  The  noun  millionaire" 
might  be  defined: 

(NMEANS  ((#PERSON)  ((#IS  ***  #PERSON) 

(♦POSSESS  $1/000/000  *TIME)))) 

Notice  that  not  every  relation  Involves  time.  Being  a 

♦PERSON  Is  assumed  to  be  a  permanent  characteristic.  If  the 

time  Is  PRESENT  (Indicated  by  the  TSS  :N0VI)/  the  system  deletes 

the  time  reference/  so  PLANNER  will  receive  the  expression 

(THGOAL  (♦POSSESS  $?X1  $1/000/000))/  where  $?X1  Is  the  variable 

assigned  to  the  object  being  described.  If  the  sentence  were 

"During  the  war/  many  millionaires  worked  In  restaurants."/  the 

time  reference  of  the  sentence  would  be  a  structure  like  ( 

(PAST)  NIL  1941  1945)/  and  the  PLANNER  expression  for 

"millionaire"  would  Include: 

(♦POSSESS  $7X1  $1/000/000  ((PAST)  NiL  1941  1945)) 
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A  different  theorem  would  be  used  for  this  case,  since  It  cannot 
look  directly  Into  the  data  base  to  see  what  the  person  has,  but 
must  look  Into  Its  past  "records"  to  reconstruct  the 
Information.  In  our  programs,  a  record  Is  kept  of  when  and 
where  objects  have  been  moved,  so  theorems  can  determine  the 
location  of  any  object  at  any  time  In  the  past. 

Since  adjectives  can  be  defined  with  NMEANS,  they  are 
treated  Identically.  PREPosItlons  and  verbs  are  usually  defined 
with  CMEANS,  which  has  the  same  conventions.  The  symbol  *TIME 
can  appear  In  the  PLANNER  description  In  the  definition,  and  Is 
deleted  If  the  applicable  time  Is  jNOW,  and  replaced  with  the 
TSS  otherwise.  The  time  applicable  to  anything  but  a  clause  Is 
that  of  the  clause  closest  above  It  In  the  parsing  tree.  This 
is  only  an  approximation,  and  does  not  take  Into  account 
ambiguities  such  as  Illustrated  In  sentence  c.  above.  In  fact, 
a  PREP  or  NG  can  have  Its  own  time  reference,  as  In  "f.  former 
millionaire",  "many  future  students",  "my  roommate  last  year", 
"the  man  on  the  table  yesterday".  This  Is  one  of  many  places 
where  the  current  semantic  system  needs  to  me  extended  by  making 
the  analysis  more  general.  It  seems  that  this  could  be  done 


within  the  framework  of  the  current  system 
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4- 2 . 9  Semantics  af  Conj unction 

The  semantic  system  does  not  handle  conjunction  as 
generally  as  does  the  parser.  A  few  cases  have  been  dealt  with 
In  a  simplified  way  —  noun  groups,  adjectives,  RSQ  clauses,  and 
MAJOR  clauses  which  are  not  questions.  The  distinction  between 
“and”  and  “but"  Is  Ignored. 

With  MAJOR  clauses,  the  conjunction  must  be  "and",  and  the 
components  are  processed  as  If  they  were  completely  separate 
sentences,  except  that  the  response  ("OK."  for  IMPERatIves,  and 
"I  UNDERSTAND."  for  DECLARatl ves)  Is  supressed  for  all  but  the 
last.  The  system  will  not  accept  sentences  joined  with  "or", 
or  "nor",  and  will  misunderstand  compounds  which  cannot  be 
separated  Into  individual  actions  (e.g.  "Build  a  stack  and  use 
three  cubes  In  It.") 

Noun  groups  can  be  connected  v/Ith  "and"  wherever  they 
appear,  and  with  "or"  If  they  are  part  of  an  argument  to  a 
command  (like  "Pick  up  a  cube  a  pyramid.").  An  OSS  Is  built 
with  the  semantic  markers  of  the  first  constICutent  NG,  the 
conjunction  Itself,  and  a  list  of  the  OSS  for  the  components. 

If  all  of  the  components  are  DEfIntte  and  the  conjunction  Is 
"and",  the  conjoined  NQ  Is  definite,  and  Its  REFERent  Is  the 
union  of  the  referents. 

The  use  of  the  conjoined  OSS  depends  on  Its  place  In  the 
sentence.  If  It  Is  the  object  or  subject  of  a  verb  or 
preposition,  the  definition  of  that  verb  or  preposition  can 
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check  explicitly  for  conjoined  structures  and  treat  them 
specially.  For  example,  "touch"  can  be  defined  so  that  the 
sentence  "A  and  B  are  touching."  will  be  represented  as  (THGOAL 
(#TOUCH  :A  :B)).  If  there  Is  no  special  check,  the  system 
assumes  that  the  desired  object  Is  the  list  of  referents.  "A 
and  8  support  C."  would  produce  (THGOAL  (#SUPP0RT  (;A  :8)  :C)). 
If  the  first  element  of  the  PLANNER  expression  (usually  the  name 
of  a  predicate)  has  a  property  MULTIPLE  on  Its  property  list, 
the  system  modifies  this  to  create  the  expression: 

(THAND(THGOAL(#SUPPORT  :A  :C)) 

(THG0AL(#2UPP0RT  ;B  :C))) 

If  the  conjoined  NG  Is  one  of  the  arguments  to  a  command, 
the  theorem  TC-CHOOSE  will  choose  the  specific  referents.  If 
the  conjunction  Is  "and".  It  will  combine  the  referents  for  each 
of  the  components  In  a  single  list.  If  It  Is  "or".  It  will 
first  choose  according  to  the  first  constituent,  then  If  a 
failure  backs  up  to  the  choice.  It  will  try  the  second,  third, 
etc.  It  does  not  lock  at  the  various  choices  In  advance  to 
decide  which  Is  most  appropriate  for  the  task  being  done. 

The  other  units  which  can  be  combined  with  "and"  and  "or" 
are  the  adjective  and  RSQ  clause.  The  semantic  structure  for 
the  conjoined  unit  Is  a  list  whose  first  element  Is  the 
conjunction,  and  the  rest  are  the  Individual  Interpretations  for 
the  constituents.  In  using  these  to  modify  an  OSS,  the  system 
combines  all  of  the  descriptions  with  THOR  or  Implicit  THAND. 

For  example,  "a  block  which  Is  In  the  box  and  Is  red"  becomes: 
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(THGOAL(#!S  $?X  #BLOCK)) 

<THG0AL(#IN  $?X  ;BOX)) 

(THGOAL(#COLOR  $?X  #RED)) 

while  "a  red  green  block"  becomes; 

(THGOAL(#lS  $?X  #BLOCK)) 

(TH0R(THG0AL(#C0L0R  $?X  #RED)) 

(THG0AL<#C0L0R  $’X  #GREEN))) 

This  coiild  easily  be  extended  to  other  modifiers  such  as 
preposition  groups.  Many  other  types  of  conjunction  could  be 
handled  without  major  changes  to  the  system,  usually  by  adding 
two  bits  of  program.  One  would  create  a  conjoined  semantic 
structure  appropriate  to  the  unit,  and  the  other  would  recognize 
It  and  take  the  approprlata  action  for  Its  use. 

Whenever  the  constituents  of  a  conjoined  structure  are 
ambiguous,  the  resultant  structure  simply  multiplies  the 
ambiguity,  taking  all  possible  combinations  of  Interpretations. 
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an  ambiguous  sentence  can  be  reduced  through  the  use  of  semantic 
markers,  and  selection  restrictions  associated  with  verbs, 
adjectives,  and  prepositions.  This  section  will  describe  the 
mechanism  for  producing  multiple  Interpretations,  assigning 
plausibilities  to  them,  and  resolving  the  ambiguities  through 
discourse  heuristics  and  Interaction  v/lth  the  user, 


Any  word  of  the  classes  ADV,  ADJ,  MOUH,  PREP,  PRON,  PROPN, 
VB,  CLASP,  or  PRT  can  Introduce  an  ambiguity  Into  the  semantic 
Interpretation.  The  remaining  classes  (such  as  NUMber  and 
DETermlner)  have  very  limited  definitions,  and  are  handled 
dl f f erentl y . 

In  general,  a  word  Is  expected  to  produce  a  list  of 
semantic  structures,  based  on  Its  definition,  and  the  other 
lists  of  semantic  structures  to  which  It  Is  related.  NOUH, 

PRON,  and  PROPN  set  up  lists  of  Object  Semantic  Structures.  ADJ, 
ADV,  and  CLASP  take  one  of  these  lists,  and  produce  a  new  list 
adding  the  modification  (and  possiblly  eliminating  anomolous 
combinations).  The  VS,  PREP,  and  PRT  (In  conjunction  with  VB) 
set  up  lists  of  Relation  Semantic  Structures,  and  other  classes 
can  modify  these  lists. 

Any  of  these  definitions  can  Involve  special  programs  for 
producing  the  list  of  structures.  For  example  the  SMIT  program 
Is  used  for  analyzing  pronouns  like  "they"  and  "It".  it 
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contains  a  complex  set  of  heuristics  and  syntactic  criteria  to 
find  the  possible  referents  of  a  pronoun  and  set  up  an 
interpretation  for  each  one.  For  simpler  words,  the  functions 
NMEANS  and  CMEANS  have  ways  to  deal  with  multiple  senses  of  a 
word. 

First,  they  both  take  as  an  argument  not  a  single 
definition,  but  a  list  of  def  Inl  •’Ions,  each  with  Its  own 
semantic  markers,  PLANNER  expressions,  etc.  Second,  each  of 
them  has  mechanisms  for  looping  through  each  of  the  relevant 
"Input"  lists,  to  produce  multiple  Interpretations.  If  the 
subject  and  object  of  a  clause  each  have  two  Interpretations, 
and  the  verb  has  three  senses,  all  twelve  combinations  will  be 
tried,  and  the  resulting  list  of  Interpretations  for  the  clause 
will  contain  as  many  of  them  as  pass  through  the  semantic  marker 
’filters".  This  Is  all  done  by  the  function  CMEANS  without 
mention  In  the  definitions.  Third,  the  definition  functions 
have  optional  ways  to  establish  a  "plausibility"  rating  and 
paraphrase  for  each  meaning. 

In  the  dictionary,  the  word  "on"  has  a  semantic  definition 
which  uses  the  function  #0N,  which  contains: 

(CMEANS  ((((#PHYS0B))  <(#PHYSOB)>>  (#0N  #1  #2  *TIME) 

NIL 

(200  (ANWHERE  ON  TOP  01-))) 
((((fPHYSOB))  ((#PHYS0B)))  (#SUPP0RT  !?2  #1  •TIME) 

NIL 

(0  (DIRECTLY  ON  THE  SURFACE))) 
((((#PLACE))  ((#PHYS08)))  (#0N  #1  #2)  NIL)) 
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This  contains  three  different  senses.  The  third  definition 
will  never  conflict  with  the  first  two,  since  It  Insists  that 
the  subject  be  a  place  rather  than  a  physical  object.  The  first 
two,  however  can  be  applied  to  the  same  objects,  and  therefore 
can  be  Involved  In  an  ambiguity.  Each  of  them  Is  given  an 
additional  argument,  to  help  resolve  ambiguities.  This  argument 
appears  after  the  standard  semantic  filters,  PLANNER  relation, 
and  semantic  markers  for  the  relation.  It  Is  composed  of  two 
elements  --  a  LISP  form  to  be  evaluated  for  a  "plausibility", 
and  a  paraphrase  of  the  meaning.  In  this  example,  the 
plausibilities  are  simply  numbers.  They  could  just  as  easily  be 
a  form  like  (PLAUSCHECK) ,  which  would  call  a  special  program 
(defined  by  the  user)  which  could  do  arbitrary  calculations  In 
order  to  decide  on  a  plausibility.  This  might  Involve  PLANNER 
deductions,  checks  on  the  exact  syntax  of  the  sentence,  or  other 
schemes  such  as  looking  through  a  network  or  other  model  In 
order  to  decide  which  meaning  fits  best  with  the  the  other  words 
In  the  sentence  and  the  subject  being  discussed. 

As  a  semantic  structure  Is  built.  It  takes  on  the  sum  of 
the  plausibilities  of  Its  components,  as  Its  own  plausibility. 

No  "pruning"  Is  done  to  eliminate  obvious  low  plausibilities, 
although  this  would  be  a  simple  addition  to  the  program.  All 
Interpretations  are  carried  along  until  the  sentence  Is  totally 
parsed. 

If  the  sentence  Is  a  command,  the  system  tries  to  cai’ry  out 
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the  most  plausible  Interpretation.  If  that  falls#  It  tries  the 
next#  ::n'i  so  on  until  one  succeeds  or  a  total  failure  causes  the 
system  to  respond  "I  CAN'T".  Questions  are  handled  more 
completely.  The  system  orders  the  Interpretations  by 
plausibility  and  finds  the  answer  for  the  most  plausible.  It. 
then  tries  again  to  answer  It#  using  only  Information  mentioned 
In  the  previous  sentence  and  Its  answer.  If  It  succeeds  In 
finding  the  same  answer#  It  deducts  500  from  the  plausibility# 
since  It  Is  unusual  to  ask  a  question  to  which  the  answer  was 
just  given#  either  explicitly  or  Implicitly.  If  the  Information 
In  the  previous  sentence  Is  not  sufficient  to  answer  It#  the 
system  then  tries  to  answer  using  only  Information  which  has 
been  mentioned  previously  in  the  discourse.  If  this  succeeds  It 
deducts  200.  If  the  plausibility  Is  higher  than  that  of  the 
next  interpretation  by  a  large  enough  margin  (a  factor  set  by 
the  user  and  called  TIMID)  It  gives  the  answer  as  found.  If 
not#  It  saves  the  answer  and  repeats  the  process  for  the  next 
Interpretation.  After  all  Interpretations  have  been  processed# 
the  answers  are  checked  to  see  If  they  are  Identical.  In  this 
case  It  doesn't  matter  which  Interpretation  Is  Intended#  and  the 
system  simply  gives  the  answer.  Finally#  If  there  are  differing 
answers#  the  user  must  be  asked  what  he  meant.  Associated  with 
each  Interpretation  Is  a  list  of  those  places  where  It  differed 
from  others.  This  Is  produced  automatically  by  each  program 
which  accepts  multiple  definitions  (such  as  NMEANS  and  CMEANS). 
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Each  difference  Is  marked  by  two  atoms  --  one  whose  properties 
Indicate  the  place  In  the  sentence  where  the  ambiguity  was 
produced/  and  the  other  Indicating  the  meaning  selected.  In 
sorting  out  the  ambiguities,  the  system  looks  for  two  such 
structures  with  the  same  first  atom  but  different  second  ones. 
The  first  can  be  used  to  decide  what  phrase  Is  questionable, 
while  the  second  atoms  carry  the  paraphrases.  Special  care  Is 
taken  to  make  sure  that  the  same  sentence  Interpretation  does 
not  Involve  two  different  Interpretations  of  a  single  element 
(like  ''It"). 

Faced  with  an  unresolvable  ambiguity,  the  system  looks 
through  the  list  of  Interpretations  for  a  conflict,  then 
generates  a  response  like: 

I’M  NOT  SURE  WHAT  YOU  MEAN  BY  "ON  TOP  OF"  IN  THE  PHRASE  "ON 
TOP  OF  GREEN  CUBES  ". 

DO  YOU  MEAN; 

1  -  DIRECTLY  ON  THE  SURFACE 

2  -  ANWHERE  ON  TOP  OF  ? 

The  response  (a  typed  number)  Indicates  which  Is  meant,  and 
all  Interpretations  which  Involve  the  other  meanings  (there  can 
be  more  than  2  associated  with  a  single  ambiguity)  are 
eliminated.  If  there  are  still  conflicting  Interpretations, 
another  ambiguity  Is  selected  and  the  process  Is  repeated  until 
all  those  which  have  not  been  eliminated  give  the  same  answer, 
and  It  can  be  used  as  a  response. 

For  Interpreting  statements,  much  more  subtlety  Is  needed. 
In  general  the  choice  between  Interpretations  depends  on  how 
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"reasonable"  an  Interpretation  Is.  if  we  see  "The  book  fell  off 
the  table  because  It  was  slanted.",  we  avoid  believing  that  "It" 
refers  to  the  book,  or  that  It  was  "slanted"  by  the  author’s 
bias.  It  would  not  be  reasonable  for  either  of  these  to  be  the 
reason  for  falling  off  of  a  table. 

Parc  of  the  Interpreter  which  looks  at  declarative 
statements  must  be  able  to  evaluate  how  "surprising"  an 
Interpretation  Is,  and  to  choose  the  one  which  fits  best  Into 
the  normal  expectations.  Including  the  hearer’s  knov/ledge  about 
the  world.  This  Is  not  easy  to  formalize,  and  was  not  attempted 
In  this  project.  It  could  be  extended  to  lower  levels  to  check 
sub-units  for  reasonableness  as  they  are  formed,  to  avoid 
carrying  multiple  Interpretations  through  the  analysis  of  the 
entire  sentence.  It  seems  clear  that  people  do  this,  and  a 
truly  Intelligent  language-understanding  program  must  do  so  as 


wel  1 
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4,2.11  To  Pg  and  To  Have 

The  verbs  "be"  and  "have"  are  two  of  the  most  common  wotds 
in  F.np.llsh,  and  have  a  complex  variety  of  uses.  They  appear  In 
the  system  In  two  v/ays.  First,  In  the  grammar  they  are  treated 
specially  since  they  can  occur  as  auxlll’ary  verbs,  as  In  "1 
would  have  been  going".  In  this  use,  they  do  not  add  any 
semantic  information  except  for  helping  to  determine  features  of 
the  VG,  such  as  its  tense.  Their  other  use  Is  as  main  verbs  In 
clauses  like  "Do  you  have  a  match?"  and  "He  is  wrong."  As  a  main 
verb,  "be"  is  handled  specially  In  the  grammar  since  It  can 
enter  Into  constructions  such  as  "there  Is"  which  do  not  appear 
with  any  other  verb.  However,  the  semantic  analyzer  does  not 
know  anything  special  about  "be"  and  "have".  Their  meaning  Is 
Included  In  their  definitions,  which  are  called  as  programs  just 
like  any  other  verb  definitions. 

A.  Be 

The  use  of  "be"  depends  on  the  specific  constellation  of 
objects  and  complements  In  tite  clause.  The  definition  Is  a 
program  containing  about  40  lines  of  LISP,  which  handles  those 
meanings  relevant  to  the  BLOCKS  world  (for  example.  It  cannot 
deal  with  a  "role-playing"  meaning,  like  "Laurence  Olivier  was 
Hamlet.") 

Sentences  with  the  feature  THERE,  Involving  a  constiuctlon 
like  "there  is"  are  represented  by  the  PLANNER  expression 
(#EXISTS  #1  *TIME).  This  attaches  the  correct  time,  and  might 
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be  Important  for  objects  which  can  be  created  and  destroyed,  as 
In  "Was  there  a  stack...?". 

The  other  meanings  of  "be"  Involve  INTenslve  clauses  which 
contain  an  object  and  a  complement.  One  definition  checks  for 
COMPQ  questions  like  "What  color  Is  the  block?",  to  generate  a 
PLANNER  expression  (#C0L0R  tBLOCK  $?X1).  If  the  complement  Is  a 
definite  NG,  as  In  "Is  the  green  block  the  b 1 gges t  obi ect?"  or 
"What  Is  the  biggest  obj ect?",  the  referent  will  have  already 
been  determined,  and  Is  Inserted  In  a  PLANNER  expression 
(THAMONG  ***  (0,U0Tc(  tOBJ)) ),  where  :OBJ  Is  the  referent.  This 
can  function  In  two  ways.  If  the  subject  Is  also  definite,  as 
In  the  first  example,  the  ***  will  be  replaced  by  Its  referent, 
and  the  statement  will  succeed  only  If  the  two  are  Identical. 

If  the  subject  Is  Indefinite,  the  THAMONG  statement  will  cause 
It  tj  be  assigned  to  the  same  referent  as  the  complement. 

If  the  complement  Is  a  PREPG  or  a  complex  AOJG,  like 
"bigger  than  a  breadbox",  "be"  Is  only  serving  as  a  place-holder 
which  can  accept  a  time  reference.  The  semantic  Interpreter  In 
dealing  with  a  phrase  like  "on  the  table"  In  "Is  the  block  on 
the  table?"  has  already  set  up  a  relation  of  the  form  (fON 
:BL0CK  :TABLE)  which  Includes  the  appropriate  time  reference. 

In  this  case,  the  "be"  program  simply  takes  the  RSS  produced  for 
the  complement,  and  uses  It  as  the  semantic  Interpretation  of 
the  clause. 

The  other  possibilities  for  the  complement  are  an 
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Indefinite  NG,  a  simple  ADJ6  (e.g.  a  single  adjective)/  or  a  new 
word.  In  the  case  of  a  NG,  the  complement  NG  contains 
additional  information  to  be  ascribed  to  the  subject,  as  In  "a 
large  object  v/hich  Is  a  red  b  1  ock” .  The  PLANNER  description  of 
the  complement  Is  stripped  from  its  OSS,  and  appended  to  the 
PLANNER  description  of  the  subject.  If  the  subject  is  definite, 
as  In  "Is  the  bigger  t  th I ng  a  red  block?",  the  referent  Is 
known,  and  can  be  plugged  Into  the  PLANNER  description  of  the 
complement  to  see  If  the  description  applies.  This  is  done 
using  a  pseudo-concept  called  #HASPROP  which  triggers  the 
mechanisms  In  the  semantic  Interpreter. 

If  the  compltjment  Is  a  simple  ADJG,  the  ADJG  semantic 
specialist  creates  Its  OSS  by  taking  the  OSS  for  the  subject, 
stripping  away  the  PLANNER  description,  and  using  the  rest  as  a 
skeleton  on  which  to  place  the  PLANNER  expression  produced  by 
the  adjective.  Once  this  Is  done.  It  can  be  treated  exactly 
like  an  Indefinite  NG. 

Finally,  If  the  subject  or  complement  Is  a  new  word  (as  in 
"A  frob  Is  a  big  red  cube."  or  "A  big  red  cube  is  a  frob.")  a 
new  definition  Is  created  using  the  function  #0EFINE.  The 
definition  must  be  In  the  form  of  an  Indefinite  MG,  and  the  new 
word  Is  as'-sumed  to  be  a  noun.  The  semantic  definition  created 
for  the  noun  contains  the  OSS  which  was  created  for  the  defining 
NG,  and  sets  this  OSS  up  as  the  meaning  of  the  noun  when  it  Is 


used 
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B.  Have 

The  definition  of  "have"  Is  also  used  to  handle  the 
possessive.  For  the  limited  subject  natter  (and  for  much  of 
English)  this  Is  a  good  approximation.  There  are  cases  where  It 
does  not  apply  --  "the  painting  which  John  has"  Is  not 
necessarily  the  same  as  "John's  painting."  The  preposition  "of" 
also  makes  use  of  the  same  definition.  A  more  complete 
treatment  would  distinguish  between  the  three,  and  this  would 
Involve  only  simple  changes  to  the  semantic  programs. 

The  Interesting  thing  about  "have"  Is  that  It  Is  not  used 
to  Indicate  a  few  different  relationships,  but  Is  a  place-marker 
used  to  create  relationships  dependent  on  the  semantic  types  of 
the  objects  Involved.  "Sam  has  a  mother."  can  be  represented 
(#M0THER-0F  X  SAM),  "Sam  has  a  friend."  Is  (#FRIEND  X  SAM),  "Sam 
has  a  car."  Is  (#0V-IM  SAM  CAR),  "Sam  has  support."  Is  (fSUPPORT  X 
SAM),  "Sam  has  a  hand."  Is  (#PART  SAM  HAND),  etc.  The 
definition  of  "have"  (or  the  possessive,  or  "of")  does  not 
Include  within  Itself  all  of  these  different  relations.  A  few 
Interpretations  (like  have-as-part,  owning,  or  having  In 
physical  possession)  can  be  reasonably  considered  distinct 
meanings  of  "have",  and  are  Included  In  Its  definition.  The 
others,  such  as  "mother"  and  "support"  really  are  determined  by 
the  subject  and  object.  Some  systems  use  this  fact  to  find  the 
meaning  of  special  phrases  like  "client's  lawyer"  without  doing 
syntactic  analysis  (see  section  2.5).  Our  system  uses  a 
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different  method/  allowing  a  word  to  be  defined  as  a  #R0LE. 
"Mother”  might  be  defined  as: 

(NMEANS((#PERSON  #R0LE) 

((#PERS0N  ***) 

(#M0THER-0F  ***  ?) 

(#R0LE((#PERS0N))(#M0THER-0F  #1  #2))) 

There  are  two  new  tilings  In  this  definition.  First/  the 
semantic  marker  ?R0LE  Is  added  to  Indicate  the  type  of 
•definition.  Second/  a  role  definition  Is  Included.  It  contains 
a  semantic  filter  for  objects  which  can  be  used  In  the  relation 
(In  this  case  those  which  cculd  have  a  rrother)/  and  a  PLANNER 
statement  indicating  the  relation  (In  the  same  syntax  used  by 
CMEANS).  If  the  word  "mother"  Is  used  In  a  phrase  like  "Carol's 
mother"  or  "Carol  has  a  mother"  or  "the  mother  of  Carol"/  the 
system  will  Insert  the  right  OSS  to  produce  the  PLANNER 
description  (#M0THER“0F  $?X1  CAROL)  If  "mother"  appears  In  any 
other  form/  the  OSS  will  contain  (#M0THER-0F  $?X1  ?)  which  will 
be  satisfied  In  a  PLANNER  goal  If  XI  Is  the  mother  of  anyone  at 
all . 

Through  the  #R0LE  mechanism/  arbitrary  relationships  can  be 
expressed  with  "have"  (or  "of"/  or  possess  Ives)  without  bloating 
Its  definition.  There  could  be  more  than  one  #R0LE  assigned  to 
a  word  as  well.  For  example  "painting"  would  Involve  different 
roles  for  "Rembrandt's  painting"  "George  Washington's  painting 
by  Stuart"/  "the  Modern  Museum's  painting."/  etc. 
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4^2.12  AddLtJQ.na]  Semantic  Information 
A.  Using  Clauses  as  Objects 

In  order  to  Interpret  a  sentence  like  "Find  a  block  which 
Is  taller  than  the  one  I  told  you  to  pick  up."  the  system  must  ♦ 

use  a  clause  ("you  to  pick  up")  as  the  object  of  a  verb 
("tell").  It  generates  a  pseudo-object  of  the  type  fEVENT#  and 
creates  an  OSS  for  that  object.  In  the  example  mentioned/  the 
clause  "you  to  pick  up"  would  have  produced  the  RSS: 


(fPICKUP  NGl 

((PAST)  NIL  NIL 

NIL)) 

MIL) 

(0)  ) 

rel  PLANNER  expression 

neg 

markers 

Figure  71 

—  RSS  for  "you 

to  pick 

up" 

NGl  is  an  OSS  describing  the  object  "the  one"/  which  the 
system  has  set  up  as  the  object  of  the  clause/  and  has 
Interpreted  as  "block".  The-  program  SMCL4  takes  this  structure 
and  produces  a  corresponding  OSS: 

(  (  ((EVXl)  0  (THGOAL  (#PICKUP  STEVXl  $?X1  ((PAST)  NIL  NIL  NIL)) 

(THUSE  TCTE-PICKUP))) 

(0  #EVENT  iTHING) 

(#THING)) 

EVXl 

(1  INOEF  NIL) 

NIL) 

Figure  72  --  OSS  for  "you  to  pick  up" 

A  variable  was  generated  for  the  event/  of  the  form  EVXn, 
and  a  new  PLANNER  expression  for  the  event  was  generated/ 
Including  the  event  name  as  the  second  element.  The  reason  for 
putting  It  second  Is  technical/  and  should  be  changed  someday 
for  programmer  convenience  and  consistency  with  the  scheme 
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described  In  Section  2.1.  In  the  expression#  the  name  of  the 
OSS  is  replaced  with  its  associated  variable  (In  this  case  $?X1) 
since  the  new  structure  will  be  used  as  part  of  the  description 
of  that  object.  The  recommendation  list  includes  the  theorem 
which  is  designed  to  deal  with  expressions  Involving  time  and 
event-names#  and  is  put  In  by  the  system.  In  working  with  the 
rest  of  the  sentence#  this  resultant  OSS  can  be  used  just  like 
any  other  OSS#  as  an  object  of  a  verb#  preposition#  etc. 

When  PLANNER  evaluates  the  expression#  it  may  have  the 
event  already  stored  away#  or  it  may  have  to  deduct'  that  It 
happened  by  looking  at  other  events.  This  is  handled  by  the 
theorem  TCTE-PICKUP#  and  the  name  of  the  resultant  event  Is  the 
value  which  Is  assigned  to  tne  variable  EVXl. 

B.  Types  of  Modification 

There  are  a  variety  of  ways  In  which  a  modifier  can  affect 
the  meaning  of  the  phrase  or  clause  It  modifies.  Since  the 
definition  is  a  program#  the  user  has  great  freedom  to  use 
different  types  of  modification.  A  time  modifier  like  "now"  or 
"then"  will  modify  the  Time  Semantic  Structure  associated  with 
the  clause#  an  adverb  like  "quickly"  may  set  up  a  new  relation 
such  as  (#SPEED  $?EV1  #FAST)  using  the  name  of  the  event#  while 
others  may  make  changes  directly  to  the  relation  being 
constructed.  The  semantic  structures  previously  built  can  be 
analyzed  and  modified  by  an  arbitrary  function  which  suits  the 
meaning  of  the  modifier.  One  special  facility  exists  for  making 
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substitutions  within  an  expression.  If  the  PLANNER  expression 
of  a  CMEANS  or  NMEANS  definition  Is  of  the  form  (#SUBST  al  a2  bl 
b2...},  the  effect  will  be  to  modify  the  existing  semantic 
structure  by  substituting  the  atom  a2  for  al/  b2  for  bl/  etc. 

No  new  expression  Is  added  to  the  PLANNER  description.  The  word 
’•move"  might  be  defined  using: 

(CMEANS((((#ANIMATE))((#MANIP)))  (#PUT  #2  LOG  *TIME>  (#M0VE))) 
This  Indicates  that  moving  Is  done  by  an  animate  object  to  a 
manipulable  object/  and  Involves  putting  It  at  the  place  "LOG”. 
The  atom  LOG  would  be  given  a  OSS  Indicating  an  unknown  place. 
The  resulting  RSS  has  the  semantic  marker  #MOVE.  The  sentence 
"Move  a  block."  would  create  a  goal  (#PUT  NGl  LOG)/  where  NGl  Is 
a  description  of  "a  block".  The  theorem  for  #PUT  could  then 
choose  a  block  and  place.  If  the  sentence  Is  "Move  a  block  Into 
the  box."/  the  final  result  should  be  (#PUTIM  NGl  {BOX).  The 
modifying  phrase  makes  a  major  change  In  the  Internal 
representation  of  the  meaning. 

This  change  can  be  done  by  defining  "Into"  to  Include  among 
Its  meanings: 

(GMEANS((((#MOVE))((#BOX)))  (#SUBST  #PUTIN  #MOVE  #2  LOG)  NIL)) 
If  a  PREPG  with  the  preposition  "Into"  modifies  a  clause  with 
the  semantic  marker  #MOVE/  and  the  object  of  the  preposition  has 
the  marker  #BOX,  then  the  definition  applies.  The  RSS  for  the 
clause  Is  changed  by  substituting  #PUTIN  for  #MOVE/  and  the 
object  of  the  preposition  for  #L0C.  The  special  symbols  *1,  #2/ 
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#3,  ***,  and  *TIME  are  treated  as  they  would  be  In  a  normal 
CMEANS  or  NMEANS  definition,  being  replaced  by  the  appropriate 
object. 

C.  Using  Evaluation  In  CMEANS  and  NMEANS 

Although  every  definition  has  the  power  to  use  programs, 
definitions  using  the  standard  forms  CMEANS  and  NMEANS  are 
forced  Into  a  rather  rigid  syntax  which  does  not  have  a 
procedural  character.  To  avoid  this,  there  Is  an  extra  levelof 
evaluation.  If  the  PLANNER  portion  of  a  definition  Is  of  the 
form  (#EVAL  s)  where  s  Is  any  LISP  atom  or  s-express Ion,  the 
form  will  be  EVALled  before  the  description  Is  used  In  the 
definition,  and  Its  value  used  Instead.  This  value  will  undergo 
the  usual  substitutions  for  #1,  #2,  *TIME,  etc.  This  feature  Is 
of  particular  use  In  capturing  the  semantic  regularities  of  the 
language  by  using  auxllllary  functions  In  defining  words.  For 
example,  color  adjectives  like  "red*'  and  "blue"  share  most  of 
their  characteristics.  They  apply  to  physical  objects.  Involve 
a  relation  with  #COLOR,  etc.  Rather  than  define  them 
separately,  we  would  like  a  single  function  #COLOR  which  needs 
only  to  have  the  exact  color  specified.  The  dictionary 
definition  of  blue  would  then  be  (#COLOR  #BLUE) .  The  function 
#COLOR  can  be  defined  In  LISP: 

(DdFUN  #C0L0R  FEXPR  (A) 

(NMEANS((#PHYS08)  (#EVAL  (LIST(LIST  (QUOTE  #COLOR) 

(QUOTE  ***) 

(CAR  A))))))) 
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When  (#COLOR  #BLUE)  Is  evaluated,  the  #EVAL  will  produce 
the  form  ((#COLOR  ***  iBLUE)),  which  will  then  be  used  by  NMEANS 
In  the  usual  way. 

As  another  example,  the  word  "grasp”  can  be  used  to  mean 
#<5RASPING  (an  object  being  held)  or  #GRASP  (the  action  of 
closing  the  fingers  around  It).  The  difference  depends  on 
whether  the  VG  Is  progressive  or  not.  The  function 
(PROGRESSIVE)  finds  out  whether  the  clause  Is  progressive,  by 
looking  at  the  verb  and  the  tense.  The  definition  of  "grasp" 
can  be: 

( CMEANS ( ( ( ( »AN I MATE) ) ( ( #MAN IP))) 

(#EVAL  (COND  ((PROGRESSIVE)  (QU0TE(#3RASPIMG  #2  •TIME))) 

(T  (QUOTE  (#GRASP  #2  •TIME)))))  NIL)) 

D.  Some  Interesting  Problems 

There  are  many  areas  In  which  the  semantic  analysis  needs 

to  be  refined  and  expanded.  The  system  does  not  pretend  to 

contain  a  complete  analysis  of  English,  but  Is  rather  an 

Illustration  of  how  many  aspects  of  semantics  could  be  handled. 

This  section  describes  a  few  places  where  modification  might 

begin. 

1.  Definite  Determiners 

in  our  system,  a  definite  noun  phrase  Is  Interpreted  as 
referring  to  a  unique  object  or  set  of  objects  known  to  the 
hearer.  In  more  general  language  use,  def Ini tenv%ss  Is  often 
used  to  convey  new  Information.  The  phrase  "my  brother  who 
lives  In  Chicago"  can  be  said  to  someone  who  is  not  aware  I  have 
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a  brother,  and  the  effect  Is  to  Inform  him  that  Indeed  I  do,  and 
to  tell  him  where  this  brother  lives.  Other  nouns  can  describe 
"functions",  so  that  "the  title  of  his  new  book",  or  "my 
address",  are  allowable  even  If  the  hearer  has  not  heard  the 
title  or  address,  since  he  knows  that  every  book  has  a  unique 
title,  and  every  person  an  address.  Superlative  phrases  like 
"the  tallest  elephant  In  Indiana"  also  refer  to  a  unique  object, 
even  though  the  hearer  may  not  have  seen  or  heard  of  this  object 
before. 

Cases  such  as  these  can  lead  to  problems  of  referential 
opacity.  If  your  name  Is  "Seymour",  and  I  say  "Excuse  me,  I've 
never  heard  your  name.".  It  does  not  Imply  th=^t  I  have  never 
heard  the  name  Seymour.  The  sentence  "I  want  to  own  the  fastest 
car  In  the  world."  does  not  have  the  same  meaning  If  we  replace 
the  NG  with  Its  current  referent  --  I  don't  want  whichever  car 
It  Is  that  happens  to  be  fastest  right  now. 

These  and  other  such  problems  need  to  be  handled  in  the 
programs  for  Interpreting  a  definite  NG,  using  syntactic, 
semantic,  and  world  knowledge. 

2.  Verb  Tenses 

The  current  system  Implements  only  a  few  of  the  possible 
tenses  —  PRESENT,  PAST,  PRESENT  IN  PRESENT,  PRESENT  IN  PAST, 
PAST  IN  PAST,  and  an  elementary  form  of  the  MODAL  "can". 

Section  4.2.8  described  some  of  the  problems  which  can  be 
Involved  In  time  reference,  and  a  deeper  anlaysls  Is  needed  to 
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account  for  Interactions  between  the  order  of  phrases  and  the 
possibilities  for  time  reference.  The  modalS/  conditionals# 
subjunctives#  etc.  need  to  be  handled.  This  may  demand  a 
version  of  PLANNER  which  can  temporarily  move  Into  a 
hypothetical  world#  or  which  has  more  power  to  analyze  1 own 
theorems  to  ansv/er  questions  Involving  models  like  "can"  and 
"must". 

3.  Conjunction 

Only  the  most  elementary  problems  In  conjunction  have  beer, 
dealt  with  In  the  current  system.  Many  conjoined  structures  do, 
not  yet  have  semantic  analyzer  programs#  end  no  subtlety  Is  used 
In  deciding  on  the  meaning  of  words  like  "and".  "And"  can  be 
used  to  Indicate  temporal  sequence  ("We  went  to  the  circus  and 
came  home.")  causality  ("We  saw  him  and  understood.")#  as  a  type 
of  conditional  ("Do  that  again  and  I'll  clobber  you!")#  In 
specification  of  how  to  do  something  ("Be  a  friend  and  help 
me.")#  etc.  Understanding  these  uses  will  be  related  to  the 
discourse  problem  of  the  ordering  of  sentences.  For  example# 
"The  light  Is  on.  He's  there."  Indicates  a  chain  of  reasoning. 

In  addition#  no  attempt  has  been  made  to  disambiguate 
nested  structures  like  "A  and  B  or  C"#  or  "the  old  men  and 
women."  Syntactic  criteria  are  not; sufficient  for  these 
distinctions#  and  a  powerful  semantic  program  will  have  to  be 
used  to  ask  "which  Interpretation  makes  more  sense  In  this 

II 


case 
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4.  Non-syntactic  Relations 

There  are  some  piaces  In  English  where  the  relation  between 
a  set  of  words  is  not  indicated  by  syntactic  clues,  but  Is 
largely  based  on  semantics.  One  example  Is  chain  of  classifiers 
before  a  noun.  In  "strict  gun  law",  the  law  is  strict,  but  in 
"stolen  gun  law",  the  gun  Is  stolen.  It  Is  possible  to  combine 
long  strings  like  "a  helical  aluminum  soup  pot  cover  adjustment 
screw  clearance  sale".  In  which  a  large  amount  of  semantic 
Information  must  be  combined  with  the  ordering  to  find  the 
correct  interpretation.  The  current  system  handles  classifiers 
by  assuming  that  they  all  separately  modify  the  head.  This 
needs  to  be  changed,  to  use  both  the  semantic  markers,  and 
complex  deductions  to  find  the  real  relationships. 
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4,3  The  Semantics  of  Discourse 

In  section  4.1.  we  discussed  the  different  types  of  context 
which  can  affect  the  way  a  sentence  Is  Interpreted,  In  this 
section  we  v/lll  describe  the  specific  mechanisms  used  by  our 
program  to  Include  context  In  Its  Interpretation  of  language. 

We  have  concentrated  on  the  "local  discourse  context",  and  the 
ways  In  which  parts  ov  the  meaning  of  a  sentence  can  be  referred 
to  by  elements  of  the  next  sentence.  For  example,  pronouns  like 
"It"  and  "they"  can  refer  to  objects  which  have  been  previously 
mentioned  or  to  an  entire  event,  as  In  "Why  did  you  do  I t?" . 

The  words  "then"  and  "there"  refer  back  to  a  previous  time  and 
place,  and  words  like  "that"  can  be  used  to  mean  "the  one  most 
recently  mentioned",  as  In  "Explain  that  sentence." 

In  addition  to  referring  back  to  a  particular  object,  we 
can  refer  back  to  a  description  In  order  to  avoid  repeating  it. 
We  can  say:  "Is  there  a  small  grey  elephant  from  Zanzibar  next 
to  a  big  one?" .  Sometimes  Instead  of  using  "one"  to  avoid 
repetition,  we  simply  omit  part  of  a  phrase  or  sentence.  We  can 
reply  to  "Would  you  like  a  corned-beef  sandwich?"  with  "Bring  me 
or  we  can  respond  to  almost  anything  with  "Why?"  In  these 
examples,  the  second  sentence  Includes  by  Implication  a  part  of 
the  first. 

These  are  not  really  discourse  features,  since  they  can 
appear  just  as  well  In  a  single  sentence.  In  fact,  there  are 
some  sentences  which  would  be  almost  Impossible  to  express 
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without  using  one  of  these  mechanisms/  such  as:  "Find  a  block 
which  Is  bigger  than  anything  which  supports  li."  These 
mechanisms  can  be  used  to  refer  back  to  anything  mentioned 
previously/  whether  In  an  earlier  sentence  of  the  speaker,  one 
of  the  replies  to  him,  or  something  occurring  earlier  In  the 


same  utterance 
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iul*l  Pronouns 

First  we  will  look  at  the  use  of  pronouns  to  refer  back  to 
objects.  Since  our  robot  does  not  know  any  people  other  than 
the  one  conversing  with  It,  It  has  no  trouble  with  the  pronouns 
"you"  and  "I"  which  always  refer  to  the  two  objects  :SHRDLU  and 
jFRIENO.  a  more  general  program  would  keep  track  of  who  was 
talking  to  the  computer  In  order  to  find  the  referent  of  "I". 

Vihen  the  NG  p.ogram  In  the  grammar  finds  a  NG  consisting  of 
a  pronoun,  it  calls  the  program  which  Is  the  definition  of  that 
pronoun.  The  definitions  of  "It"  and  "they"  use  a  special 
heuristic  program  called  SM1T,  which  looks  Into  the  discourse 
for  all  of  the  different  things  they  might  refer  to,  and  assigns 
a  plausibility  value  to  each  Interpretation.  If  more  than  one 
Is  possible,  they  are  carried  along  simultaneously  through  the 
rest  of  the  sentence,  and  the  ambiguity  mechanism  decides  at  the 
end  which  Is  better.  Including  the  last-resort  effort  of 
printing  out  a  message  asking  for  clarification.  If  SMIT  finds 
two  different  Interpretations,  and  one  Is  chosen  because  of  a 
higher  plausibility,  the  system  types  out  a  message  to  Inform  us 
of  the  assumptio:!  made  In  choosing  one  interpretation,  as  in 
Sentence  3  of  Section  1.3: 

BY  "IT",  I  ASSUME  YOU  MEAN  THE  BLOCK 
WHICH  IS  TALLER  THAN  THE  ONE  I  AM 
HOLDING. 

If  a  response  from  the  user  Is  needed,  the  request  Is  typed 
In  the  same  format  as  the  message  used  for  other  ambiguities,  as 
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In  sentence  24  of  Section  1.3.  In  the  case  of  sentence  3,  It 
would  be: 

I'M  NOT  SURE  WHAT  YOU  MEAN  BY  "IT"  IN 
THE  PHRASE  "PUT  IT  INTO  THE  BOX" 

DO  YOU  MEAN: 

1  -  THE  BLOCK  WHICH  IS  TALLER  THAN  THE 

ONE  I  AM  HOLDING 

2  -  THE  ONE  I  AM  HOLDING  ? 

A  simple  transformation  Is  used  to  switch  "you"  with  "1", 
and  make  the  corresponding  verb  changes/  and  the  words  are 
borrowed  directly  from  the  Input  sentences. 

In  our  discussion  of  pronounS/  we  will  use  "It"  as  typical. 
In  most  caseS/  "they"  (or  "them")  Is  treated  Identically  except 
checking  for  agreement  with  plural  rather  than  singular.  The 
pronouns  "he"  and  "she"  never  occur  In  our  limited  subject 
matter,  but  they  would  be  treated  exactly  like  "It",  except  that 
they  would  make  an  extra  check  to  see  that  their  referent  Is  In 
fact  animate  and  of  the  right  gender. 

The  first  thing  checked  by  SMIT  Is  whether  "It"  has  already 
appeared  In  the  same  sentence.  We  very  rarely  use  the  same 
pronoun  to  refer  to  two  different  objects  In  the  same  sentence, 
so  It  Is  generally  safe  to  adopt  the  same  Interpretation  we  did 
the  first  time.  If  there  were  several  possible  Interpretations, 
the  system  Is  careful  not  to  match  up  one  Interpretation  from 
one  occurrence  of  "It"  with  a  different  one  from  another 
occurrence  In  building  an  overall  Interpretation  of  the 


sentence 
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Similarly,  If  "It"  was  used  In  the  previous  sentence.  It  Is 
likely  that  If  used  again  It  will  refer  to  the  same  thing.  In 
either  of  these  cases,  SMIT  simply  adopts  the  previous 
Interpretation. 

Next,  a  pronoun  may  be  Inside  a  complex  syntactic 
construction  such  as  "a  block  which  Is  bigger  than  anything 
which  supports  JLE,."  English  uses  the  reflexive  pronouns,  like 
"Itself"  to  refer  back  to  an  object  In  the  same  sentence. 
However,  If  In  going  from  the  pronoun  to  the  referent  on  the 
parsing  tree.  It  Is  necessary  to  pass  through  another  NG  node, 
an  ordinary  pronoun  like  "It"  Is  used,  since  "Itself"  would 
refer  to  the  Intermediate  NG.  Notice  that  If  we  replaced  "It" 
by  "Itself"  Ip  our  sentence.  It  would  no  longer  refer  to  the 
block,  but  to  "anything". 

SMIT  looks  for  this  case  and  other  related  ones.  Vlhen  such 
a  situation  exists,  the  program  must  work  differently. 
Ordinarily,  when  we  refer  to  "It"  we  have  already  finished 
finding  the  referent  of  the  MG  being  referred  back  to,  and  "It" 
can  adopt  this  referent.  In  this  case,  we  have  a  circle,  where 
"It"  Is  part  of  the  def In  1 1 Ion  of  the  object  It  Is  referring  to. 
The  part  of  the  program  which  does  variable  binding  In  relating 
objects  and  clauses  Is  able  to  recognize  this,  and  treat  It 
correctly  by  using  the  same  variable  for  "a  block"  and  "It". 

The  pronoun  may  also  refer  to  an  object  In  an  embedded 
clause  appearing  earlier  In  the  same  clause,  as  In  "Before  you 
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pick  up  the  red  cube/  clear  JLi  off.*'  SMIT  looku  through  the 
sentence  for  objects  In  such  acceptable  places  to  which  "It" 
might  refer.  If  It  doesn't  find  them  there/  It  begins  to  look 
at  the  previous  sentence.  The  pronoun  may  refer  to  any  object 
In  the  sentence/  and  the  meaning  will  often  determine  which  It 
Is  (as  In  our  example  about  the  demonstrators  In  the  Preface). 

We  therefore  cannot  eliminate  any  of  the  possibilities  on 
syntactic  grounds/  but  can  only  give  them  different  ratings  of 
"plausibility".  For  example/  In  Section  4.2.5  we  discussed  the 
Importance  of  a  "focus"  element  In  a  clause.  "It"  Is  more  likely 
to  refer  to  the  previous  focus  than  to  other  elements  of  the 
clause.  Similarly,  the  subject  Is  a  more  likely  candidate  than 
an  object/  and  both  are  more  likely  than  a  MG  appearing  embedded 
In  a  PREPG  or  a  secondary  clause. 

The  system  keeps  a  list  of  all  of  the  objects  referred  to 
in  the  previous  sentence/  as  well  as  the  entire  parsing  tree. 

By  using  PROGRAMMAR'S  functions  for  exploring  a  parsing  tree/ 
SMlT  Is  able  to  find  the  syntactic  positon  of  all  the  possible 
references  and  to  assign  each  a  plausibility,  using  a  fairly 
arbitrary  but  hopefully  useful  set  of  values  (for  example  we  add 
200  for  the  focus  element  beyond  what  It  would  normally  have  for 
Its  position  as  subject  or  object).  In  order  to  keep  the  list 
of  the  objects  In  the  last  sentence,  our  semantic  system  has  to 
do  a  certain  amount  of  extra  work.  If  we  ask  the  question:  "Is 
any  block  supported  by  three  pyramids?",  the  PLANNER  expression 
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produced  is: 

(THFIND  ALL  $?X1  (XI) 

(THGOAL(#IS  $?X1  #BL0CK)) 

(THFIND  3  J!?X2  (X2) 

(THGOAL(#IS  $?X2  ifpYRAMIO)) 

(THG0AL(#SUPP0RT  $?X2  $?X1)))) 

Once  this  Is  evaluated.  It  returns  a  list  of  all  tne  blocks 

satisfying  the  description,  but  no  record  of  what  pyramids 

supported  then.  If  the  next  sentence  asked  "Are  they  tall?",  we 

would  have  no  objects  for  "they"  to  refer  to.  Special 

Instructions  are  Inserted  Into  our  PLANNER  descriptions  which 

cause  lists  like  this  to  be  saved.  The  actual  PLANNER 

expression  produced  would  be: 

(THPUTPROP  (aUOTE  XI) 

(THFIND  ALL  $?X1  (XI) 

(THGOAL(i;S  $?X1  #BLOCK)) 

(THPUTPROP  (QUOTE  X2) 

(THFIND  3  $7X2  (X2) 

(THGOAL(#IS  S7X2  #PYRAMID)) 
(THGOAL(#SUPPORT  $7X2  $7X1))) 
(QUOTE  BIND))) 

(QUOTE  BIND)) 

This  only  occurs  when  the  system  is  handling  discourse. 

Finally,  "It"  can  be  used  In  a  phrase  like  "Do  rtl"  to 
refer  to  the  entire  main  event  of  the  last  sentence.  This 
LASTEVENT  Is  saved,  and  SMIT  can  use  It  to  replace  the  entire 
meaning  of  "do  It"  with  the  description  generated  earlier  for 
the  event. 

When  "that"  Is  used  In  a  phrase  like  "do  that".  It  Is 
handled  In  a  similar  way,  but  with  an  Interesting  difference. 

If  we  have  the  sequence  "Why  did  you  pick  up  the  ball7"  "To 
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build  a  stack.”  "Hov;  did  you  do  It?”/  the  phrase  "do  It”  refers 
to  "Pick  up  a  ball”.  But  If  we  had  asked  "How  did  you  do 
that?”/  It  would  refer  to  building  a  stack.  The  heuristic  Is 
that  "that”  refers  to  the  event  most  recently  mentioned  by 
anyone/  while  "It”  refers  to  the  event  most  recently  mentioned 
by  the  speaker. 

In  addition  to  remembering  the  participants  and  main  event 
of  the  previous  sentence/  the  system  also  remembers  those  In  Its 
own  responses  so  that  It  can  use  them  when  they  are  called  for 
by  pronouns.  It  also  remembers  the  last  time  reference/ 
(LASTIME)  so  the  word  "then”  can  refer  back  to  the  time  of  the 
previous  sentence. 

Special  uses  of  "It”  (as  In  "It  Is  raining.”)  are  not 
handled/  but  could  easily  be  added  as  further  possibilities  to 
the  SMIT  program. 
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iLiJ.i.1  g.P.bgt.U3U:.&.§  Jncompletes 

The  next  group  of  things  the  system  needs  to  Interpret 
Involves  the  use  of  substitute  nouns  like  "ore",  and  Incomplete 
noun  groups  like  "Buy  me  two . "  Here  we  cannot  look  back  for  a 
particular  object,  but  must  look  for  a  description.  SMIT  looks 
through  a  list  of  particular  objects  for  Its  meaning.  SMONE 
(the  program  used  for  "one")  looks  back  into  the  Input  sentence 
Instead,  to  recover  the  English  description.  "One"  can  be  used 
to  stand  for  part  or  all  of  that  description. 

As  with  "It",  "one"  can  refer  back  to  something  In  a 
previous  sentence,  the  previous  reply,  or  earlier  In  the  same 
sentence.  Here  though,  there  are  no  restrictions  about  where 
In  the  parsing  tree  the  description  can  lie.  "One"  depends  more 
on  surface  characteristics  than  on  structural  differences.  For 
example.  It  cannot  refer  back  to  a  NG  which  Is  a  pronoun  or  uses 
a  TPRON  like  "anything".  Our  program  for  "one"  Is  not  as 
complex  as  the  one  for  "It".  It  Is  primarily  based  on  the 
heuristic  of  "contrast".  People  often  use  "one"  to  contrast  two 
characteristics  of  basically  similar  objects,  for  example  "the 
big  red  block  and  the  little  one."  The  program  must  understand 
these  contrasts  to  Interpret  the  description  properly.  We 
realize  that  "the  lltle  one"  means  "the  little  red  block",  not 
"the  little  big  red  block"  or  "the  little  block".  In  order  to 
do  this,  our  system  has  as  part  of  Its  semantic  knowledge  a  list 
of  contrasting  adjectives.  This  Information  Is  used  not  only  to 
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decide  how  much  of  the  description  Is  to  be  borrowed  by  "one'*, 
but  also  to  decide  which  description  In  a  sentence  "one"  Is 
referring  to.  If  we  say  "The  green  block  supports  the  big 
pyramid  but  not  the  little  one."  It  Is  fairly  clear  that  "one" 
refers  to  "pyramid".  But  If  we  say  "The  big  block  supports  the 
green  pyramid  but  not  the  little  one.",  thun  "one"  might  refer 
to  "block".  The  only  difference  Is  the  change  of  adjectives  -- 
"big"  and  "little"  contrast,  but  "green"  and  "llttl*»"  do  not. 

Our  program  looks  for  such  contrasts,  and  If  It  finds  one.  It 
assumes  the  most  recent  contrasting  description  Is  the  referent. 
If  there  Is  no  contrast  between  the  phrase  being  analyzed  and 
any  NG  In  the  same  sentence,  previous  answer,  or  previous 
sentence.  It  then  looks  for  the  most  recent  NG  which  contains  a 
noun. 

It  Is  Interesting  to  note  that  SMONE  causes  the  system  to 
parse  some  of  Its  own  output.  In  order  to  use  the  fragment  of  a 
NG  It  finds,  SMONE  must  know  which  elements  It  can  use  (such  as 
noun,  adjective,  and  classifier)  and  which  It  does  not  (such  as 
number  and  determiner).  For  the  noun  groups  In  previous  Inputs, 
the  parsing  Is  available,  but  for  the  reply,  only  the  actual 
words  are  available  and  It  Is  necessary  to  construct  a  simple 
parsing  before  understanding  the  meaning  of  "one".  It  does  not 
call  the  entire  system  recursively  to  do  this,  but  uses  a 
simplified  version. 

An  Incomplete  NG,  containing  only  a  number  or  quantifier  Is 
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used  In  much  the  same  way  as  "one".  In  fact.  If  we  look  at  the 
series  "Buy  me  three."  "Buy  me  two."  "Buy  me  one.",  we  see 
they  are  nearly  Identical.  We  can  take  the  view  that  an 
Incomplete  MG  actually  has  an  Implied  substitute  noun  of  "one". 
This  Is  the  way  our  program  handles  Incomplete  noun  groups. 

Currently  the  set  of  contrasts  Is  stored  separately  as 
special  properties  In  the  dictionary  entries  of  the  adjectives 
Involved.  It  would  be  better  to  combine  this  with  the  semantic 
marker  systeu,  or  the  actual  system  of  PLANNER  programs  and 


concepts. 
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4.3.3  Qvcrjl  I  D.i^c.!;:;ui.SS  Context 

We  have  discussed  several  ways  of  using  overall  discourse 
context  in  understanding.  VJe  have  so  far  experimented  with  only 
one  of  these  --  keeping  track  of  what  has  been  mentioned  earlier 
in  the  discourse.  This  Is  not  the  same  as  looking  back  In  the 
previous  sentence  for  pronoun  references,  as  It  may  Involve 
objects  several  sentences  back  or  occurring  In  separate 
sentences.  If  there  are  many  blocks  are  on  the  table,  we  can 
have  a  conversation:  "What  is  In  the  box?"  "A  block  and  a 
pyramid."  "What  Is  behind  It?"  "A  red  block  and  another  box." 
"What  color  Is  the  box?"  "Green,"  "Pick  up  the  two  blocks." 

The  phrase  "the  two  blocks  Is  to  be  Interpreted  as  a 
particular  pair  of  blocks,  but  there  may  be  others  In  the  scene, 
and  nowhere  In  the  dialog  were  two  blocks  ment.Ioned  together. 

The  system  needs  a  way  to  keep  track  of  when  things  were 
mentioned.  In  order  to  Interpret  "the"  as  "the  most  recently 
mentioned"  in  cases  like  this. 

To  do  so,  we  use  PLANNEP.'S  facility  for  giving  properties 
to  assertions.  When  we  mention  a  "green  block",  the  semantic 
system  builds  a  PLANNER  description  which  Includes  the 
expressions: 

(THG0AL(#IS  $?«i  »BL0CK))  (THGOALC #C0L0R  $?X1  'GREEN)) 

After  the  sentence  containing  this  phrase  has  been  Interpreted, 
the  system  goes  back  to  the  PLANNER  descriptions  and  marks  all 
of  the  assertions  which  were  used,  by  putting  the  current 
sentence  number  or.  their  property  lists.  This  Is  also  done  for 
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the  assertions  used  In  generating  the  descriptions  of  objects  In 
the  answer. 

When  the  semantic  programs  find  a  definite  NG  like  "the  two 
red  blocks",  the  second  NG  specialist  (SMNG2)  uses  PLANNER  to 
make  a  list  of  all  of  the  objects  which  fit  the  description.  If 
there  are  the  right  number  for  the  NG,  these  are  listed  as  the 
"reference"  of  the  NG,  and  the  Interpretation  of  that  NG  Is 
finished.  If  there  are  fewer  than  called  for  by  the  determiners 
and  numbers,  SMNG2  makes  a  note  of  the  English  phrase  which  was 
used  to  build  the  description,  and  returns  a  message  to  the 
parser  that  something  has  gone  wrong. 

If  the  parser  manages  to  parse  the  sentence  differently, 
all  Is  well.  If  not,  the  system  assumes  that  the  NG 
Interpretation  was  the  reason  for  the  falluia,  and  the  system 
uses  the  stored  phrase  to  print  out  a  message  "I  don't 
understand  what  you  mean  by..." 

However,  If  there  are  too  many  objects  which  match  the 
description,  SMNG2  tries  to  find  out  which  were  mentioned  most 
recently.  It  does  this  by  using  PLANNER  to  recheck  the 
description  for  the  I  terns  It  found,  but  this  time  using  only 
those  assertions  mentioned  In  this  cr  the  previous  sentence. 

This  Is  easily  done  by  using  PLANNER'S  ability  to  put  a  "filter" 
on  the  assertions  to  be  retrieved  from  the  data  base,  allowing 
only  those  which  are  accepted  by  a  LISP  predicate  (which  In  this 
case  looks  for  a  sentence  number  In  the  appropriate  range  on  the 
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property  list  of  the  assertirii.  See  <Sussman>  for  details.)  If 
It  finds  the  right  lumber#  these  must  be  the  reference  of  the 
NG.  If  It  finds  too  few#  It  can  reiterate  the  procedure#  but 
using  all  of  the  assertions  mentioned  In  the  last  two  sentences. 
This  backward  progress  continues  until  at  some  point  either  It 
finds  the  right  number  or  the  number  found  Jumps  from  below  the 
right  number  to  above  It.  In  this  case  a  message  of  failure  Is 
returned  to  the  parser  as  before#  but  a  marker  Is  set  so  that  In 
case  the  sentence  cannot  be  understood#  the  message  returned  Is 
"I  don't  know  which...  you  mean"#  as  In  sentence  2  of  Section 
1.3: 

1  DON'T  UNDERSTAND  WHICH  PYRAMID  YOU  MEAN. 

Of  course#  a  more  sophisticated  program  Is  needed  for 
determining  the  referents  of  definite  NGs.  If  the  conversation 
Is  about  a  book#  the  NG  "the  title"  Is  clearly  acceptable  and 
must  refer  to  the  title  of  that  book#  even  though  no  title  was 
previously  mentioned.  If  there  Is  a  complicated  NG  like  "the 
biggest  cat  In  the  world"  or  "the  thing  I  want  to  do  most"#  the 
program  must  deduce  from  the  form  of  the  NG  that  there  Is  only 
one  such  object#  even  though  It  has  not  been  previously 
mentioned#  and  that  It  Is  being  defined  by  this  description. 
There  Is  a  large  area  for  expansion  In  dealing  with  these  and 
similar  problems. 

It  seems  likely  that  these  and  other  similar  aspects  of 
language  can  be  handled  systematically#  using  the  formalism  of 
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procedures  within  a  system  like  the  one  described  in  this  paper. 
The  semantic  and  deductive  systems  provide  a  framework  within 
which  we  can  formalize  .ind  combine  a  variety  of  aspects  of 
meaning  and  knowledge  to  deal  with  the  real  problems  of 
semantics . 
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4.4  Generation  of  Responses 

In  this  section  we  will  describe  how  our  language- 
understanding  system  generates  Its  linguistic  responses.  This 
aspect  was  net  emphasized  as  much  In  the  research  as  the  problem 
of  understanding  language,  and  we  have  not  developed  a  general 
theory  of  discourse  generation.  The  programs  produce  behavior 
which  seems  reasonable  for  carrying  on  a  dialog  about  the  robot 
world.  In  particular  we  dealt  with  four  different  aspects  of 
dlsco’irse  production  --  patterned  responses,  appropriate  answers 
to  questions,  generating  descriptions  of  objects  and  events,  and 
the  production  of  fluent  discourse. 

**■4.1  PatterCL^d  Responses 

The  easiest  way  to  get  language  behavior  from  a  computer 
program  Is  to  Include  a  set  of  fixed  responses  which  are 
appropriate  for  the  situations  It  will  encounter.  These 
responses  can  be  made  as  complex  and  varied  as  we  want,  since 
they  are  created  by  the  programmer,  and  the  program  only  repeats 
them.  Many  language  systems  make  heavy  use  of  these  fixed 
responses  to  achieve  “good  sounding"  output. 

In  general,  these  responses  are  not  flexible  enough  for  a 
true  language  system,  but  there  are  places  where  they  are 
appropriate  --  for  example,  when  the  program  wants  to 
acknowledge  that  something  has  happened,  or  that  It  has  reached 
a  certain  point  In  Its  analysis.  Our  system  uses  several  types 
of  patterned  response.  First  there  Is  a  set  of  simple  responses 


Section  4.4.1  -  Page  385 


for  specific  situations.  The  system  responds  "OK.'*  when  a 
command  Is  carried  out/  "t  UNDEftSTAND."  when  a  declarative 
sentence  Is  analyzed,  "I  DON'T  UNDERSTAND."  when  a  sentence 
cannot  be  analyzed,  and  "I  CAN'T."  when  a  command  cannot  be 
executed. 

A  slightly  more  complex  type  of  response  Involves  "filling 
In  the  blank"  with  a  phrase  borrowed  from  the  Input.  The 
simplest  example  In  our  system  Is  "SORRY,  I  DON'T  KNOW  THE  WORD 
PLEASE  TYPE  TWO  SPACES."  The  offending  word  Is  taken  as 
It  was  Input  and  Inserted  In  the  blank.  The  "two  spaces"  are  to 
clear  the  Input  buffer  of  characters  typed  after  the  message  was 
sent.  Two  slightly  more  complex  types  of  response  Involve 
manipulating  the  determiners  of  the  phrase  which  was  Input. 
the  user  types  something  like  "the  three  green  pyramids",  and 
the  system  cannot  figure  out  what  he  Is  referring  to.  It  types 
"I  DON'T  KNOW  WHICH  THREE  GREEN  PYRAMIDS  YOU  MEAN."  It  has 
simply  replaced  "the"  with  "which"  before  filling  the  blank. 

The  "I  assume"  mechanism  does  the  opposite,  replacing  an 
Indefinite  determiner  with  "the".  If  we  talk  about  "some  green 
pyramid"  or  "a  green  pyramid",  then  later  refer  to  that  pyramid 
as  "It",  the  system  can  notify  us  of  Its  Interpretation  of  "It" 
by  saying  "BY  'IT'  I  ASSUME  YOU  MEAN  THE  GREEN  PYRAMID."  Here 
the  system  has  removed  the  Indefinite  determiner  or  quantifier 
("a"  or  "some")  and  replaced  It  with  "the".  It  uses  knowledge 
about  various  kinds  of  determiners  to  fill  In  the  pattern 
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reasonably  In  a  number  of  cases  (for  example#  possessives  are 
left  alone#  since  they  are  already  definite).  It  can  also 
handle  the  use  of  "they"  to  refer  to  several  objects. 

Our  most  complex  "blank-filling"  response  Is  the  one  which 
handles  ambiguity.  It  uses  parts  of  the  Input  sentence  and 
pieces  of  the  definitions  of  words  stored  In  the  dictionary.  In 
the  response  to  sentence  24  of  the  sample  dialog  (section  1.3): 

24.  how  many  things  are  on  top  of  green 
cubes? 

the  system  typed; 

I'M  NOT  SURE  WHAT  YOU  MEAN  BY  "ON  TOP 
OF  "IN  THE  PHRASE  "ON  TOP  OF  GREEN 
CUBES  "  . 

DO  YOU  MEAN: 

1  -  DIRECTLY  ON  THE  SURFACE 

2  -  ANWHERE  ON  TOP  OF  ? 

The  tv/o  phrases  "on  top  of"  and  "on  top  of  green  cubes" 
were  taken  directly  from  the  Input#  and  the  paraphrases 
"directly  on. the  surface"  and  "anywhere  on  top  of"  were  part  of 
the  dictionary  definition  provided  for  "on".  One  of  the  answer 
routines  looks  at  the  list  of  places  where  the  Interpretations 
differ#  and  generates  this  request  for  clarification.  The 
response  must  eliminate  at  least  one  possibility#  and  the 
program  continues  generating  questions  like  this  one  until  only 
one  Interpretation  remains.  If  the  ambiguity  Is  caused  by 
multiple  Interpretations  for  a  pronoun#  the  English  text  of  the 
phrases  originally  used  to  refer  to  the  objects  are  used  In  the 
response  choices. 


Section  4.4.2  -  Page  387 


4^4 .2  Answering  (luestlonv 

In  order  to  carry  on  a  discourse#  we  need  to  know  what  types 
of  responses  people  expect  to  different  types  of  questions.  In 
Section  2.3.3  we  classified  various  types  of  questions 
syntactically#  and  we  can  use  ti.ls  classification  to  select 
appropriate  answers.  The  system  answers  whenever  possible  with 
a  complete  phrase#  providing  as  much  Information  as  possible. 

No  attempt  Is  made  to  produce  f u  1 1  sei'stences#  since  In  fact 
people  rarely  answer  questions  with  sentences.  The  natural 
response  to  "Which  block  Is  In  the  box?"  Is  "the  red  block"#  or 
"the  red  one"#  certainly  not  "The  red  block  Is  In  the  box."  We 
have  tried  to  analyze  what  sort  of  phrases  provide  the  most 
Informative  and  natural  responses. 

The  first  case  Is  the  WH-  question#  and  within  that  class# 
the  NGQCsee  section  2.3.2}#  Involving  a  question  NOUN  GROUP# 
such  as  "which  block"#  "who"#  "how  many  pyramids"#  or  "what". 

We  can  divide  these  Into  three  classes  —  "how  many"#  "vague"# 
and  "specific".  If  asked  "VIhat  Is  In  the  box?"#  we  can  answer 
"a  blue  block  and  a  pyramid"  without  being  more  specific  at 
Identifying  them.  If  Instead  we  are  asked  "VIhIch  block  Is  In 
the  box?"  we  must  use  a  more  specific  description  like  "the 
large  blue  cube  which  supports  a  pyramid."  The  program  must 
generate  English  descriptions  of  particular  objects#  and  It  must 
be  able  to  generate  both  definite  and  Indefinite  descriptions. 
This  part  of  the  program  will  be  described  In  the  next  section. 
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The  use  of  Its  results  Is  straightforward  for  NGQ  questions.  If 

¥ 

the  NG  Is  "what",  we  generate  Indefinite  descriptions  of  the 
object  or  objects.  If  It  Is  "which...",  we  generate  a  definite 
description.  "Who"  Is  never  a  problem,  since  the  system  only 
knows  of  two  people,  "you",  and  "I".  There  are  also  default 
responses,  so  that  a  question  like  "Which  block  supports  the 
table?"  can  be  answered  "NONE  OF  THEM." 

HOWMANY  questions  are  answered  with  the  number  of 
appropriate  objects,  followed  by  "OF  THEM"  to  make  the  discourse 
smoother.  For  example,  the  response  to  dialog  sentence  6,  "How 
many  blocks  are  not  In  the  box?".  Is  "FOUR  OF  THEM." 

The  next  type  of  question  Is  the  QADJ,  such  as  "why", 
"when",  "how",  or  "where".  The  only  three  which  have  been 
Implemented  so  far  are  "why"  "when",  and  "hov;",  but  the  others 
can  be  done  In  an  analogous  fashion.  A  "why"  question  Is 
answered  using  the  system's  memory  of  the  subgoals  It  used  In 
achieving  its  goals  In  manipulating  toy  objects.  If  we  can 
decide  what  event  Is  being  referred  to  In  the  question,  we  can 
see  what  goal  called  I t  as  a  subgoal,  and  answer  by  describing 
*  the  higher  goal  In  English.  If  the  event  was  Itself  a  top  level 

goal.  It  must  have  been  requested  as  a  command,  and  the  response 
Is  "BECAUSE  YOU  ASKED  ME  TO.". 

We  need  a  program  which  creates  an  English  description  of 
an  event  from  Its  PLANNER  description  like  (#PUTON  ;B3  :TABLE) . 
It  must  generate  phrases  which  refer  to  the  objects  Involved, 
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and  combine  them  Into  a  clause  of  the  proper  type  with  the 
proper  tense.  This  program  Is  also  described  In  the  next 
section. 

We  can  use  the  same  event-descr Iber  to  answer  "how" 
questions  by  describing  all  of  the  events  which  were  subgoals 
used  In  achieving  the  event  mentioned.  We  say  "BY...",  then 
list  each  event  In  an  "Ing"  form,  as  In;  "BY  PICKING  UP  A  RED 

BLOCK  AND  PUTTING  IT  IN  THE  BOX."  If  the  event  was  Itself  a 

lowest- level  goal,  the  system  has  no  way  of  analyzing  Its  own 

programs  for  achieving  that  goal,  and  answers  "I  CAN'T  ANALYZE 
HOW" . 

"When"  questions  are  answered  similarly  --  a  time  Is  named 
by  describing  the  top-level  goal  which  was  being  carried  out  at 
the  time,  saying  "WHILE  I  WAS..."  and  using  the  "Ing"  form  to 
describe  the  event.  This  Is  Inappropriate  If  the  question 
refers  directly  to  the  top-level  goal  (We  can't  answer  "When  did 
you  build  the  stack?"  with  "WHILE  I  WAS  BUILDING  THE  STACK."), 
and  In  that  case  we  say  "BEFORE..."  and  name  the  top-level  goal 
Immediately  following  In  time.  If  the  goal  mentioned  was  just 
done,  the  system  replies  "JUST  NOW.".  In  addition  to  the  normal 
responses,  the  system  has  a  set  of  fixed  responses  such  as 
"NEVER",  and  "I  CAN'T  EXPLAIN  A  NON-EXISTENT  EVENT."  to  answer 
questions  which  demand  them. 

Finally  we  come  to  YES-NO  questions  which,  paradoxically, 
are  the  most  complicated.  It  seems  that  a  one  word  answer  Is 


Section  4.4.2  -  Page  390 


called  for,  but  this  Is  often  Impossible  and  rarely  the  best  way 
to  respond.  If  we  ask  "Does  the  block  support  three  pyramids?", 
and  In  fact  It  supports  four,  what  Is  the  correct  answer?  The 
system  could  ask  for  clarification  of  the  Implicit  ambiguity 
between  "at  least  three"  and  "exactly  three",  then  answer  "yes" 
or  "no".  But  It  Is  more  efficient  and  helpful  to  answer  "FOUR 
OF  THEM",  leaving  the  speaker  to  Interpret  his  own  question. 

If  there  were  only  two  pyramids,  the  system  could  respond  "NO", 
but  It  would  be  more  Informative  to  say  "NO,  ONLY  TWO  OF  THEM". 
In  any  of  these  cases,  we  might  be  even  more  complete,  and  say 
something  like  "FOUR  OF  THEM;  A  BLUE  ONE,  TWO  RED  ONES,  AND  A 
LARGE  GREEN  ONE." 

There  are  three  parts  we  can  Include  In  the  answer  --  "yes" 
or  "no",  a  number,  and  a  description  of  the  objects.  Remember 
that  whenever  possible  a  YES-NO  question  Is  treated  like  a  NG 
question  by  looking  for  a  focus  element  (an  Indefinite  NG  In  the 
question).  A  question  like  "Does  the  pyramid  support  a  block?" 
Is  treated  logically  like  the  question  "Which  block  does  the 
pyramid  support?"  ,  or  "How  many  blocks  does  the  pyramid 
support?"  All  three  send  a  THFIND  ALL  request  to  PLANNER,  asking 
for  all  of  the  objects  fitting  the  description; 

(THPROG  (X) 

(THGOAL  (#IS  $?X  #BL0CK)) 

(THGOAL  (fSUPPORT  ;B5  $?X))) 

whe  e  :B5  Is  the  system's  Internal  name  for  the  pyramid  being 
referred  to  by  "the  pyramid"  (this  would  be  determined  In 
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separate  calls  to  PLANNER).  In  the  case  of  the  HOWMANY  question 
we  answer  with  the  number  of  objects  found.  Kor  the  "which" 
question,  we  name  the  objects.  In  the  case  of  a  YES-NO 
question,  we  answer  with  all  three  kinds  of  Information,  saying 
"YES,  TWO  OF  THEM:  A  LARGE  RED  ONE  AND  THE  GREEN  ONE." 

The  first  element  Is  "YES"  If  the  answer  Is  clearly  yes 
(for  example  If  the  number  Is  matched  exactly,  or  the  number  In 
the  original  question  was  Indefinite  as  In  this  example),  "MO" 

If  It  Is  clearly  no  (for  example  If  there  are  fewer  than 
requested,  none  at  all,  or  the  request  was  of  a  form 
"exactly..."  "at  least..."  "more  than..."  etc.  and  was  not  met), 
and  Is  omitted  If  there  Is  a  question  about  Its  Interpretation 
(as  described  above). 

The  second  element,  the  number.  Is  omitted  If  the  number 
found  matches  the  request  (For  example,  "Ar-;  there  three 
blocks?"  Is  not  answered  redundantly,  "YES,  THREE  OF  THEM;  A 
GREEN  ONE  AND  TWO  LARGE  RED  ONES.").  The  phrase  "OF  THEM" 
following  the  number  Is  changed  to  "THINGS"  If  the  focus 
contains  a  TPRON  like  "anything",  or  "something".  If  the  number 
found  Is  less  than  that  In  the  focus.  It  Is  preceded  by 
"ONLY...",  so  the  answer  comes  out  "NO,  ONLY  TWO  OF  THEM;...") 

At  the  end  of  a  response,  we  put  the  description  of  the 
objects  found,  unless  the  request  used  a  special  number  format 
such  as  "exactly...",  "at  least..."  etc.  In  which  case  the 
system  assumes  the  number  Is  'e  Important  than  the  specific 
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objects.  We  use  the  object-naming  program  In  Its  Indefinite 
mode.  If  the  focus  originally  appeared  as  the  object  of  a 
preposition,  we  repeat  that  preposition  before  the  description 
to  clarify  the  ansv/er.  Thus.  "Is  the  pyramid  on  a  block?"  Is 
answered  "YES,  ON  A  LARGE  GREEN  ONE."  The  unknown  agent  of  a 
passive  like  "Is  It  supported?"  is  Implicitly  the  object  of 
"by",  so  the  answer  Is  "YES,  BY  THE  TABLE."  If  a  YES-NO 
question  contains  no  possible  focus  since  all  of  Its  NGs  are 
DEFInIte,  as  in  "Does  the  table  support  the  box?",  the  system 
answers  simply  "YES"  or  "NO". 
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4.4.3  N.Q'n.I.Qfi  p.bleQ.t;?  and  Events 

The  previous  section  covers  all  of  the  different  types  of 
questions  the  system  can  handle#  and  the  types  of  phrases  It 
uses  In  response.  We  have  not  yet  explained  how  It  names  an 
object  or  describes  an  event.  This  Is  done  with  a  set  of 
PLANNER  and  LISP  functions  which  examine  the  data  base  and  find 
relevant  Information  sbout  objects.  These  programs  take 
advantage  of  the  fact  that  the  subject  matter  Is  limited.  In 
general/  the  way  an  object  Is  named  Is  highly  dependent  on  what 
the  person  being  spoken  to  Is  Interested  in  and  what  he  already 
knows.  This  has  not  been  dealt  with  yet.  Certain  features  of 
objects/  such  as  their  color  and  size#  are  assumed  to  be  the 
best  way  to  describe  them  In  all  contexts. 

First  we  need  to  know  how  the  object  Is  basically 
classified.  In  the  BLOCKS  world#  the  concept  *IS  represents 
this#  as  In  (#IS  ;HAND  #HAND)#  (#IS  :B1  #BL0CK)#  and  (#IS  #BLUE 
fCOLOR).  The  naming  program  for  objects  first  checks  for  the 
unique  objects  In  Its  world#  "I”#  "you"#  "the  table"#  "the  box"# 
and  "the  hand".  If  the  object  Is  one  of  these#  these  names  are 
used.  Next  It  checks  to  see  If  It  Is  a  color  or  shape#  In  which 
case  the  English  name  Is  simply  the  concept  name  without  the 
The  question  "What  shape  Is  the  pyramid?"  Is  answered 
"pointed."  since  It  has  the  shape  fPOINTED.  If  the  the  object 
Is  not  one  of  these  and  Is  not  a  fBLOCK#  #BALL#  or  a  ^PYRAMID 
the  program  gives  up.  If  It  Is  one  of  those  three#  the  correct 
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noun  Is  used  (Including  a  special  check  of  dimensions  to  see  If 
a  #BLOCK  Is  a  "cuhe")/  and  a  description  Is  built  of  Its  color 
and  size.  At  each  stage  of  building^  the  description  Is  checked 
to  see  If  It  refers  uniquely  to  the  object  being  described.  If 
so,  the  determiner  "the"  Is  put  on,  and  the  description  used 
without  further  addition.  If  there  Is  only  one  ball  In  the 
scene.  It  will  always  be  referred  to  as  "the  ball". 

If  the  description  Includes  color  and  size,  but  still  fits 
more  than  the  desired  object,  the  outcome  depends  on  whether  a 
specific  description  or  an  nonspecific  one  Is  called  for.  If  It 
Is  nonspecific,  the  program  puts  the  Indefinite  pronoun  "a"  or 
"an"  on  the  beginning  and  produces  something  like  "A  LARGE  GREEN 
CUBE".  If  It  Is  specific,  more  Information  Is  needed  If  the 
object  supports  anything,  the  program  adds  the  phrase  "WHICH 
SUPPORTS..."  then  Includes  the  English  descriptions  (Indefinite) 
of  all  the  objects  It  supports.  If  the  object  supports  nothing, 
the  program  adds  "WHICH  IS  TO  THE  RIGHT  OF..."  and  names  all  of 
the  objects  to  the  left  of  the  desired  one.  This  still  may  not 
characterize  the  object  uniquely  In  some  situations,  but  the 
system  assumes  that  It  does.  If  at  any  point  In  the  dialog,  an 
object  Is  given  a  proper  name.  It  Is  referred  to  using  only  the 
noun  and  the  phrase  "NAMFO...",  as  In  "THE  BLOCK  NAMED 
SUPEPBLOCK." 

Naming  events  Is  relatively  straightforward.  V/1  th  each 
event  type  (such  as  #PUT0N  or  #STACKUP)  we  associate  a  small 
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program  which  generates  an  English  name  for  the  event  and 
combines  It  properly  with  the  names  of  the  objects  Involved. 

For  example/  the  definition  for  tPUTON  Is; 

(APPEND  (V8FIX  (QUOTE  PUT))  OBJl  (QUOTE  (ON))  0BJ2) 

VBFIX  Is  a  program  which  puts  the  verb  Into  the  right  form  for 
the  kind  of  clause  needed  to  answer  the  question,  (for  example/ 
“Ing  for  answering  "how"/  or  Infinitive  for  answering  "why"). 

It  takes  Into  account  the  changes  In  spelling  Involved  In  adding 
endings,  OBJl  and  0BJ2  are  bound  by  the  system  to  the  English 
names  of  the  objects  Involved  In  the  event/  using  the  object- 
naming  program  described  above.  APPEND  Is  the  LISP  function 
which  puts  together  the  four  Ingredients  end  to  end.  We 
therefore  get  descriptions  like  "BY  PUTTING  A  LARGE  RED  CUBE  ON 
THE  TABLE".  There  Is  a  special  check  for  the  order  of  particles 
and  objects/  so  that  we  output  "TO  PICK  UP  THE  SMALL  BLUE 
PYRAMID."/  but/  "TO  PICK  IT  UP"  rather  than/  "TO  PICK  UP  IT". 
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4.4.4  Generating  Discourse 

The  previous  sections  described  a  generating  capability 
which  can  produce  reasonabi e  Engl  I sh  answers  to  different  types 
of  questions/  but  used  by  themselves/  the  features  described 
would  produce  awkward  and  stilted  responses  which  would  at  times 
be  Incomprehensible.  Even  though  we  have  mentioned  some 
dIscourse-1 Ike  patterns  (like  "...OF  THEM"  following  a  number)/ 
we  have  not  yet  discussed  the  real  problems  of  discourse.  The 
system  uses  three  different  discourse  devices  In  producing  Its 
answers.  These  are  much  more  limited  than  the  range  of 
discourse  features  It  can  understand/  but  they  are  sufficient  to 
produce  fluent  dialog. 

The  first  problem  Involves  lists  of  objects.  Our  Initial 
way  of  naming  more  than  one  object  Is  to  simply  string  the 
descriptions  together  with  commas  and  "AND".  We  might  end  up 
with  an  answer  like  "YES/  FOUR  OF  THEM:  A  LARGE  BLUE  BLOCK/  A 
SMALL  RED  CUBE/  A  SMALL  RED  CUBE/  AMD  A  SMALL  RED  CUBE."  To 
avoid  this  redundancy/  the  obje«- t-namer  looks  for  Identical 
descriptions  and  combines  them  with  the  appropriate  number  to 
get  "A  LARGE  BLUE  BLOCK  AND  THREE  SMALL  RED  CUBES."  (Note  that 
It  also  must  change  the  noun  to  plural). 

The  next  problem  Is  the  use  of  substitute  nouns.  We  would 
like  to  respond  to  "Is  there  a  red  cube  which  supports  a 
pyramid?"  by  "YES/  A  LARGE  ONE."  Instead  of  "YES/  A  LARGE  RED 
CUBE."  By  comparing  the  English  descriptions  of  the  objects  with 
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the  wording  of  the  focus  In  the  Input  sentence/  we  can  omit 

those  nouns  and  adjectives  they  share  end  replace  them  by  "ONE". 

The  third  problem  Is  more  serlous/  as  Ignoring  It  can  lead 

to  Incomprehensible  responses.  Consider  the  answer  to  question 

32  in  the  dialog  ("Kov;  did  you  do  It?").  If  we  did  not  use  the 

proncun  "It"  or  the  determiner  "that"/  the  response  would  be: 

BY  PUTTING  A  LARGE  RED  BLOCK  ON  THE  TABLE/  THEN  LETTING  GO 
OF  A  LARGE  RED  BLOCK/  THEN  PUTTING  A  LARGE  GREEN  CUBE  ON  A 
LARGE  RED  BLOCK/  THEN  LETTING  GO  OF  A  LARGE  GREEN  CUBE/ 

THEN  PUTTING  THE  RED  CUBE  ON  A  LARGE  GREEN  CUBE/  THEN 
LETTING  GO  OF  THE  RED  CUBE. 

How  many  different  blocks  and  cubes  are  Involved?  In  describing 
events/  we  must  have  some  way  to  Indicate  that  we  are  referring 
to  the  same  object  more  than  once.  We  can  do  this  using  "It" 
and  "that"/  and  at  the  same  time  can  use  these  words  to  Improve 
the  smoothness  of  the  discourse  In  other  ways.  The  system  has 
heuristics  which  lead  It  to  use  "It"  to  refer  to  an  object  In  an 
event  It  Is  describing  whenever:  1.  the  same  object  was  called 
"It"  In  the  question.  2.  the  object  was  called  "It"  In  the 
question  preceding  the  current  one/  and  "It"  was  not  used  In  the 
current  one.  3.  the  object  has  already  been  named  In  the 
current  answer/  and  was  the  first  object  In  the  answer  4.  no 
objects  have  yet  been  named  In  the  current  answer/  and  the 
object  was  the  only  one  named  In  the  previous  answer. 

To  refer  to  an  object  already  named  In  the  current  answer, 
other  than  the  first/  the  program  applies  the  determiner  "that" 
to  the  appropriate  noun/  to  get  a  phrase  like  "BY  PUTTING  A 
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GREEN  BLOCK  ON  A  RED  CUBE  THEN  PUTTING  THAT  CUBE  IN  THE  BOX.*' 
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iu4,tS  -Fu-ture  Development 

The  generation  of  language  is  a  complex  subject/  wide  open 
to  future  development.  Our  current  system  is  just  a  beginning, 
and  has  some  major  deficiencies.  First,  we  would  like  to 
describe  an  object  by  using  facts  which  are  relevant  to  the 
context.  In  our  simple  world,  we  have  declared  by  flat  that 
color,  size,  and  support  relationships  are  the  Important  facts 
about  an  object.  We  could  just  have  well  have  used  location  to 
get  answers  like  "the  block  nearest  to  the  back  of  the  table". 
With  a  wider  range  of  subjects,  v/e  would  need  much  more 
sophisticated  heuristics  for  deciding  what  features  of  an  object 
will  serve  best  to  Identify  It  to  the  hearer. 

Se  ond,  we  do  not  have  a  way  to  turn  an  arbitrary  PLANNER 
expression  Into  English.  Vie  can  handle  only  specific  Objects 
and  simple  events.  There  are  a  number  of  applications  for  a 
more  powerful  English  generator.  For  example.  In  case  of 
ambiguity,  we  shouldn't  have  to  Include  special  paraphrases  In 
the  definition.  The  system  should  be  able  to  look  at  the  two 
PLANNER  descriptions  and  describe  the  difference  directly  In 
Engl  1 sh. 

The  system  should  be  able  to  tell  us  more  about  Itself  and 
how  It  does  things.  If  we  ask  a  question  like  "How  do  you  build 
stacks?".  It  should'  be  able  to  look  at  Its  own  programs  and 
convert  them  to  an  English  description  like  "First  1  find  a 
space,  then  1  choose  blocks,  then  1  put  one  of  the  blocks  on 
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that  space^  then..."  PLANNER'S  structure  of  goals  and  subgoals 
Is  Ideal  as  a  subject  for  this  kind  of  description,  and  a  great 
deal  could  be  done  along  this  line.  in  a  more  speculative  vein, 
the  development  of  discourse  generators  which  could  convert  an 
Internal  logical  format  Into  natural  language  might  lead  to 
computer  essay  writers,  or  translators  which  could  understand 
the  material  they  were  working  with. 
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4.5  Comparison  with  Other  Semantic  Systems 
X  Introduction 

This  section  compares  our  semantic  system  with  tv/o  other 
models  for  the  semantics  of  natural  language.  Each  has  served 
as  the  basis  for  computer  programs  which  ''understand"  language^ 
and  we  will  consider  the  usefulness  of  the  models  for  this 
purpose  as  well  as  their  value  as  theoretical  models  of  a 
natural  process. 

The  three  models  can  be  labelled  "categorization", 
"association",  and  "procedure".  These  do  not  represent  a  cross 
section  of  semantic  theories,  but  represent  one  particular  type 
of  theory.  They  assume  that  It  Is  meaningful  to  postulate  a 
conceputal  organization  of  human  knowledge,  related  by  semantics 
to  the  linguistic  forms  used  In  expressing  thoughts.  This  sets 
them  cff  from  traditional  approaches  which  avoided  postulating 
mental Istlc  structures  and  dealt  Instead  with  extra-mental 
representations  such  as  logical  truth  conditions  or  stimulus- 
response  relationships. 

These  three  models  are  oriented  towards  viewing  language  ?s 
a  human  activity  rather  than  an  abstract  calculus  of  symbols. 
They  study  the  process  In  terms  of  human  models,  and.  take  Into 
account  the  production  and  1 ntepretat Ion  of  language.  One 
possible  reason  why  the  current  syntactic  theories  have  been 
weak  In  developing  semantic  theories  Is  their  Insistence  on  a 
"neutral"  characterization  of  the  competence  ov  a  language  user. 
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without  regard  to  the  process  carried  ou':  In  an  Intelligent 
speaker  or  hearer.  Since  semantics  Involves  the  Interaction 
between  the  structures  of  the  language  and  the  knowledge  and 
Intelligence  of  the  language  user.  It  cannot  be  understood 
without  trying  to  deal  with  this  Intelligence  directly. 
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4.5.2  Cdtegfld.z.a;.ittn  * 

The  Categorization  model  of  semantics  was  developed  In  Its 
best  known  form  by  Katz  and  Fodor  <Katz>,  and  has  been  a  part  of 
many  computer  systems  for  understanding  natural  language.  "* 

The  basic  principle  Is  a  structure  of  categories,  called 
"semantic  markers",  dividing  the  conceputal  world  much  as  the 
Dewey  decimal  classification  subdivides  the  books  In  a  library. 

The  usual  sense  of  the  word  "bachelor"  has  the  semantic  markers 
"male”,  "human",  "animate",  "physical  object",  etc.  and  the 
f I nal ' d I  St i ngu I sh 1 ng  characteristic  ("never  having  been 
married")  Is  Its  "semantic  dl st 1 ngu 1 sher". 

in  choosing  between  different  senses  of  a  word  In  a 
particular  sentence,  these  markers  are  combined  according  to 
"projection  rules".  For  example,  the  word  "colorful"  would  be 
Interpreted  In  one  sense  In  "colorful  cube",  another  In 
"colorful  party",  while  the  rules  would  Indicate  that  "a 
colorful  ball"  has  two  possible  readings. 

Further  Information  can  be  gleaned  from  the  logical 
relations  between  the  markers  such  as  the  fact  that  "male 
uncle"  Is  redundant,  while  "female  uncle"  Is  anomalous. 

Fodor  and  Katz  did  not  attempt  to  explain  the  process  of 
producing  and  understanding  language  In  terms  of  these  markers, 

f 

preferring  to  see  them  as  abstract  neutral  relationships 
underlying  the  speaker's  competence*  They  did  not  deal  In  any 
systematic  way  '‘.'Ith  those  aspects  cf  meaning  which  cannot  be 


Sectinn  U.5.2 


Par.e  40U 


dealt  with  through  this  type  of  categorization. 

♦  Nevertheless,  categorization  has  been  used  In  many  computer 

programs  for  understanding  natural  language,  to  help  choose  the 
right  meaning  for  potentially  ambiguous  words.  Schank  <Schank 

^  1959,  1970>  has  extended  the  application  of  this  theory,  using 

semantic  relationships  to  parse  sentences  conceputally. 
Associated  with  each  sense  of  a  word  Is  a  conceptualization, 
specifying  the  semantic  relationships  of  that  word  with  other 
words  In  the  same  structure.  For  example,  one  meaning  of  the 
word  "hit"  would  be  an  action  of  physical  striking,  whose 
subject  Is  a  "person",  whose  object  Is  a  "physical  object",  and 
which  has  a  possible  Instrument  of  the  category  "v/eapon".  The 
sentence  "I  hit  the  boy  v/Ith  <?  stick."  would  be  parsed  by 
noticing  words  In  categories  which  could  fill  the  roles,  and  by 
setting  up  an  appropriate  structure.  It  could  also  account  for 
the  Interpretation  In  which  "hit"  Involves  striking  with  a  fist, 
and  "with"  represents  posaess  Ion  of  a  "physical  object"  by  a 
"person",  but  this  would  be  found  only  on  "prompting"  (see 
<Schank  1970>,  p.  26). 

The  underlying  belief  Is  that  humans  make  much  use  of  this 

% 

sort  of  categorization  In  understanding  sentences,  rather  than 
doing  a  complete  syntactic  parsing.  The  sentence: 

"The  window  the  ball  the  boy  threw  hit  broke." 

Is  understood  more  easily  than; 

"The  man  the  woman  the  girl  knew  liked  died." 
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Fador  and  Gafrett  <Fodor  19Fj7>  studied  sentences  like  tnese/  and 
found  that  they  are  more  east:/  understood  when  the  categories 
associated  with  the  verbs  can  indicate  the  conceptual  structure. 
For  sentence  fragments  and  ungrammatical  utterances/  this 


ability  seems 


vital 


The  second  model  has  Its  roots  more  In  psychology,  and  the 
presence  of  "associations"  between  words  In  a  person's  mind.  It 
postulates  a  sprawling  collection  of  words  and  concepts, 
connected  tc  each  other  by  simple  links.  If  "goose"  Is 
connected  to  "quill"  and  "quill"  to  "pen"  and  "pen"  to  "Ink", 
there  Is  a  path  of  length  three  from  "goose"  to  "Ink".  These 
links  would  be  present  In  Information  such  as  "A  quill  Is  a 
foose  feather",  "Pens  can  be  made  of  quills",  etc.  The 
justification  for  this  model  Is  that  the  course  of  a  path 
describes  the  relationship  betwee.i  the  two  nodes,  and  that  Its 
length  Is  a  measure  of  their  relatedness.  The  use  of 
association  as  a  model  for  computer  language  understanding  has 
been  most  Influenced  by  the  work  of  Qullllan  <0ul Ilian  1964, 
1969>.  Information  Is  coded  Into  the  network  of  concepts  using 
several  types  of  links  (for  example,  the  clafis-subclass 
relationship  used  In  the  categorization  model).  In 
understanding  a  sentence,  a  search  Is  Initiated  through  the 
network  from  each  of  the  content  words  of  the  sentence,  to  find 
the  shortest  paths  linking  them.  The  system  uses  the 
Information  along  that  path  to  decide  what  the  sentence  Is 
about. 

It  Is  Important  to  understand  why  I  call  this  the 
"association"  model  Instead' of  the  "network"  model.  The  word 
"network"  has  been  used  to  refer  to  every  conceivable  variety  of 
data  structure.  The  semantic  markers  In  the  categorization 
model  form  a  network  ,  Schank  <Schank  1969>  refers  to  the 
output  of  his  parser  as  a  "language-free  conceptual  network". 
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recent  ,>arsers  <V/oods  1970>  are  called  "auRirented  transition 
networks'*#  while  our  parser  uses  systemic  grammar  with  "system 
networks".  Each  of  these  "newtorks"  represents  a  completely 
different  sturcture  and  use  of  data.  Saying  that  a  structure  Is 
a  "network"  Is  not  much  more  Informative  than  saying  that  It  Is 
represented  by  bits  In  a  computer  memory.  What  must  be  stated 
Is  the  way  the  netv/ork  Is  used. 

A  central  committment  of  the  association  model  Is  that 
There  Is  a  significance  to  tracing  along  the  links  frori  node  to 
node  Ignoring  their  content.  Once  a  path  Is  found,  all  sorts  of 
logical  operations  may  be  used  to  determine  Its  s 1  go  1 f 1 canct  and 
make  use  of  Its  information,  but  In  the  propagation,  .'i  tclninjm 
of  calculation  Is  done  at  each  node. 

It  Is  difficult  to  formalize  a  "minimum"  of  calculation, 
but  It  Is  important  to  have  some  understanding  of  Its 
Implications.  Any  computation  whatever,  can  be  expressed  as  a 
network  by  drawing  a  flow  chart,  with  the  blocks  of  computation 
as  nodes,  and  the  transfers  of  control  as  links.  The 
computation  then  traces  a  path  through  the  net.  It  might  seem 
that  there  Is  something  Inherently  different  between  a  program 
following  a  single  path  through  a  flowchart,  and  a  signal 
propagating  In  all  directions  through  a  net.  However  the 
difference  disappears  If  we  allow  some  sor:  of  taarallel 
processing  (for  example  the  pseudo-s tmul taraous  e/aluatlon  of 
several  paths,  as  found  In  many  simulation  languages,  and  some 
theorem-provers  such  as  new  versions  of  PLANNER  <ilewltt  1970>). 
This  Is  not  the  place  to  debate  the  merits  of  parallel  vs. 
serial  processing.  The  Important  thing  Is  to  realize  that  once 
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networks  carry  out  computations  at  each  node,  they  move  away 
from  the  association  model,  towards  a  model  of  semantics  as  a 
program. 

Some  types  of  natural  language  understanding  do  appear  to 
Involve  simple  associations.  On  hearing  the  words  "fortune"  and 
"almond",  a  person  w'll  (If  he  has  eaten  In  Chinese  restaurants) 
think  of  the  word  "cookie".  It  Is  hard  to  describe  logical 
connections  which  lead  to  such  a  quick  reaction,  and  much  more 
appealing  to  picture  a  short  association  path  between  "cookie" 
and  each  of  the  original  words.  The  model  can  also  be  used  to 
explain  the  choice  of  a  single  meaning  for  a  potentially 
ambiguous  word  In  a  sentence.  If  a  word  Is  connected  by  a  link 
to  each  of  the  con''epts  It  might  describe,  the  shortest  path  to 
the  other  words  In  the  sentence  should  be  through  the  relevant 
mean  I ng. 
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4  >5 .4  Procedure 

Me  can  call  the  model  of  semantics  used  In  our  system  the 
"procedure  model".  The  primary  organization  of  knowledge  Is  In 
a  deductive  program  with  the  power  to  combine  Information  about 
the  parsing  of  the  s  -ntf^-ice,  the  dictionary  meanings  of  Its 
words,  and  non-l Ingu I s t Ic  facts  about  the  subject  being 
discussed.  Any  relevant  bit  of  knowledge  can  Itself  be  In  the 
form  of  a  program  or  procedure  to  be  activated  at  an  appropriate 
time  In  the  process  of  understanding.  The  program  operates  on  a 
sentence  to  produce  a  representation  of  Its  meaning  In  some 
Internal  language.  In  our  case  PLANNER.  This  language  allows  the 
expression  of  a  wide  variety  of  the  aspects  of  language  -- 
logical  connectives  and  quantifiers,  time  ref "irences  (provided 
by  verb  tenses  and  modifiers),  different  sorts  of  object- 
modifier  rel atonshl ps,  types  of  object  reference  (e.g.  the 
difference  between  "the  dog"  and  "a  dog"),  etc. 

In  analyzing  a  sentence,  the  program  can  use  Information 
about  previous  sentences  In  the  discourse  and  about  the  subject 
being  discussed.  This  allows  It  to  deal  with  features  of 
language  such  as  pronoun  reference,  substitute  nouns,  the  effect 
of  discourse  on  specific  referents,  and  the  disambiguation  of 
meaning  through  knowledge  of  non-l Ingulstic  facts  (like  Bar- 
Hlllel's  classic  example  of  the  "box  In  the  pen"  <Bar-HH lel>) . 

Other  programs,  such  as  <Woods  1969>  also  use  the  procedure 
model.  These  programs  use  a  complete  syntactic  parsing  of  the 
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Input  to  provide  a  framework  from  which  the  program  can  decide 
what  aspects  of  meaning  to  deduce.  The  majority  of  the 
language-comprehending  programs  have  used  the  procedure  mode)  In 
a  simplified  form,  performing  only  a  few  elementary  types  of 
deduction  In  analysis,  and  having  an  Internal  language  tailored 
to  a  specific  application. 
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jul^  tvaluatlng  lii£  Models 

There  Is  no  single  set  of  criteria  to  Judge  the  success  of 
a  model  of  semcntlcs  or  a  computer  program  for  "comprehending 
language".  Success  Is  relative  to  the  goals  of  the  model#  and 
the  aspects  It  wishes  to  describe.  Four  criteria  seem  to  be  of 
importance  both  iri  computer  language  comprehension  and  In 
developing  a  theory  of  semantics.  These  are:  ability  to 
combine  syntax  and  meaning  ;  efficiency;  ability  to  explain 
human  performance;  and  the  ability  to  understand  language  In 
context.  These  will  be  discussed  separately. 

A.  Integrating  Syntax 

There  are  many  facets  to  the  meaning  of  an  utterance  In  a 
natural  language#  and  no  model  sheds  equal  light  on  ail  of  them. 
In  fact#  two  of  the  three  models  are  limited  to  one  part  of  the 
meaning  --  the  basic  semantic  relationships  between  the  words 
used  in  the  sentence. 

The  Fodor-Katz  version  of  the  categorization  model  does  not 
attempt  to  deal  with  the  part  of  meaning  expressed  by  the 
semantic  dlstlngulshers#  analyzing  only  those  aspects  which  can 
be  modelled  by  the  markers.  It  does  not  work  with  other  aspects 
of  meaning  such  as  tense#  mood#  and  reference  to  objects. 
Schank's  version  attempts  to  model  the  way  people  understand  a 
sentence#  describing  an  actual  parsing  process.  However  the 
conceptual  parsing  does  not  actually  find  the  "meaning".  One 
argument  for  the  model  is  human  ability  to  understand  utterances 
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in  a  foieign  language  without  a  detailed  knowledge  of  Che 
giamnar.  Only  the  meanings  of  words  are  needed  to  find  the 
language-free  conceptual  connections  (see  <Schank  1970>,  p.4). 
This  seems  a  good  parallel  to  the  type  of  understanding  done  by 
the  categorization  model.  /Anyone  w!io  has  tried  to  get  along  In 
an  unfamiliar  foreign  language  will  be  familiar  with  the  • 
following  experience.  A  friend  says  scjrethlng  which  contains 
the  foreign  language  equivalents  of  the  words  "like"/  "see"/  and 
"film".  The  foreign  visitor  knows  the  words  and  their  "word- 
concept  couplings"/  but  Is  totally  at  a  loss  In  trying  to 
respond/  since  the  sentence  may  have  been  any  one  of  a  vast  set# 
Including: 

"I  like  seeing  films."  "Have  you  s.en  any  films  you  liked?"  "I 
see  you  like  films."  "Would  you  like  to  see  a  film?"  "I  sav/  a 
film  I  liked."  "Did  you  like  seeing  the  fMm?"  etc. 

V/lthout  the  additional  meaning  provided  by  syntax/  It  Is 
impossible  to  understand  the  content  of  the  sentence.  If  the 
visitor  responds  "We  are  talking  about  a  person  who  sees  a  film 
and  likes  the  film"/  his  foreign  friend  can  rightfully  reply 
"Oh/  you  didn’t  understand."  This  problem  applies  equally  to  the 
association  model.  Finding  the  Intersection  of  signals  from  the 
nodes  "see"/  "film"/  and  "like"  might  produce  the  right 
conceptual  relationships/  but  none  of  the  additional 

V 

Information.  Neither  of  these  two  models  has  been  the  basis  for 
an  actual  question-answering  system/  since  they  do  not  deal  with 
the  ways  In  which  syntax  conveys  meaning/  and  therefore  Ignore 
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those  aspects  of  semantics  In  wl:lch  syntax  plays  a  larpe  role. 
The  procedure  model  pays  more"  attent i on  to  a  complete 
understanding  of  a  sentence^  trying  to  interweave  syntax, 
semantics,  and  deduction  in  order  to  actually  answer  a  question, 
use  a  piece  of  new  ..’formation,  or  follow  a  command. 

B.  Efficiency 

It  would  seem  reasonable  to  look  for  capabilities  of  the 
other  two  models  not  possible  for  the  procedure  model.  In  one 
sense,  this  quest  Is  a  joke.  Since  a  procedure  system  has 
Information  In  the  form  of  programs,  those  programs  can  Include 
simulations  of  any  other  model.  The  significant  question  Is  not 
what  Is  possible  theoretically,  but  what  Is  reasonable  to  do. 

A  program  could  play  a  simple  game  like  MIM  by  using 
standard  strategies  of  mlnlmax  and  look-ahead.  If  It  could  win. 
It  would  provide  a  successful  model  of  NIM  playing.  However  It 
makes  no  use  of  the  simple  winning  strategy,  and  therefore  Is  a 
bad  model  for  the  specific  game.  Similarly,  the  procedure  model 
approaches  semantics  In  a  general  way,  saying  that  every  part  of 
semantics  Involves  pov/ers  of  deduction  and  the  ability  to 
combine  Information  of  a  variety  of  types.  If  In  fact,  major 
parts  of  language  comprehension  can  be  explained  by  more 
elementary  approaches,  the  general  procedure  model  Is  not  a  good 
description  for  those  areas. 

The  Issue  at  stake  is  more  than  computer  efficiency.  Since 
we  are  modelling  a  natural  process,  the  criterion  of  "Occam's 
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razor"  appUes  just  as  In  any  other  science.  The  most 
satisfactory  explanation  Is  the  least  complex  one  which  can 
account  for  the  facts. 

The  area  of  computational  complexity  Is  barely  charted,  and 
It  Is  nearly  Impossible  to  determine  that  some  computation  Is 
Inherently  "more  complex"  or  "more  difficult"  than  another.  It 
Is  especially  dangerous  to  characterize  high-level  processes 
like  deduction,  since  a  computation  which  takes  Impossibly  long 
using  one  scheme  may  be  trivial  for  another.  However,  there  Is 
an  intuitive  sense  In  which  efficiency  can  be  Judged.  A 
procedure  system  could  handle  the  "fortune  --  almond"  example  by 
systematically  looking  through  the  things  It  knows  about 
fortunes  and  almonds,  and  using  some  sort  of  analogy  program  to 
test  for  relationships.  This  seems  clearly  more  complex  than 
the  presence  of  a  simple  association  link.  Those  who  advocate 
the  assoclatlonal  model  feel  that  there  will  be  many  such  cases 
In  which  the  deductive  process  needed  to  find  the  path  would  be 
Impossibly  torturous  and  lengthy. 

There  Is  also  a  complexity  of  syntactic  parsing.  The 
semantic  connections  might  give  clues  to  the  underlying 
structure  which  would  change  the  parsing  task  Into  simply 
checking  the  plausibility  of  the  relations,  and  cleaning  up  the 
details.  This  Is  the  approach  taken  by  both  Schank  and 

Quill  Ian.  The  example  Involving  the  boy,  ball,  and  window 

Involves  a  complex  syntactic  structure  which  could  not  be 
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handled  by  most  of  the  parsers  which  have  been  written  for 
computers#  yet  a  simple  set  of  semantic  criteria  seem  to  analyze 
it  directly  v/ithout  any  complex  syntax  rules. 

C.  Modelling  Human  Behavior 

It  is  difficult  to  find  psychological  experiments  which 
could  decide  between  one  model  and  another#  since  the  underlying 
conceptual  structure  Is  too  complex  to  isolate  a  "single 
interaction".  However#  sorne  examples  seem  to  suggest  the 
validity  of  various  models.  In  speech  communication#  people 
understand  sentence  fragments#  scattered  words#  and  blurred 
phrases  which  require  filling  in  much  of  the  meaning.  In  a 
model  requiring  a  complete  parsing#  this  would  add  a  great  deal 
of  complexity#  since  the  parser  would  have  to  know  about  the 
different  types  of  fragments  as  well  as  the  grammatical 
sentences.  A  semantic  relation  model  suggests  that  the  syntax 
is  only  used  at  the  end  of  the  process#  to  check  on  the 
conceptual  message.  If  the  syntax  is  lacking#  the  final  check 
is  gone#  but  the  basic  meaning  is  still  discoverable. 

Special  types  of  language  use# like  poetry#  puns#  and  jokes 
seem  to  involve  simple  assoc iat lonal  links.  Often  the  punch 
line  of  the  joke  comes  from  recognizing  the  inappropriateness  of 
the  link  which  was  made#  while  the  poem  conveys  meaning  by 
showing  that  a  link  Is  not  as  Irrelevant  as  it  outwardly  seems# 
but  hints  at  deeper  connections. 
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D.  Context 

One  of  the  most  Important  facts  about  language 
comprehension  Is  that  sentences  do  not  appear  In  logical 
Isolation,  but  are  always  part  of  a  context,  both  of  other 
utterances  and  of  the  situation  In  which  they  are  uttered.  Some 
semantlclsts  try  to  avoid  this  problem,  saying  It  goes  beyond 
the  proper  realm  of  semantics.  Katz  and  Fodor  believe  that 
'*...a  semantic  theory  cannot  be  expected  to  account  for  the  way 
setting  determines  how  an  utterance  Is  understood.”  (<Katz>  p. 
486)  However,  If  semantics  Is  to  be  a  study  of  the  way  language 
and  meaning  are  actually  related,  we  cannot  Ignore  the  facts. 

One  of  the  main  strengths  of  the  procedure  model  Is  Its 
ability  to  Include  all  sorts  of  knowledge  In  making  deductions 
at  any  stage  of  semantic  analysis.  The  program  can  call  on  the 
contextual  knowledge  just  as  easily  as  the  dictionary 
definitions  or  syntax.  Within  the  framework  of  the  basic 
procedure  model,  there  can  be  a  detailed  model  of  those  parts  of 
the  context  which  are  needed  for  understanding  (for  example  a 
memory  of  the  objects  which  have  been  mentioned,  so  pronouns  can 
refer  back  to  them).  The  examples  below  show  some  of  the 
problems  Involved  In  other  models  when  context  enters  into 
understanding. 

Schank  uses  the  sentence  "I  hit  the  man  with  a  stick.”  to 
Illustrate  conceputal  parsing.  (<Schank  1970>  p.  26)  Since 
"hit”  takes  an  Instrumental  of  the  type  "weapon",  the  conceptual 
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parser  first  assumes  that  the  phrase  "with  a  stick"  has  this 
meaning.  Let  us  look  at  two  possible  contexts  for  the 
sentence: 

1.  Three  men  attacked  me.  I  hit  the  man  with  a  stick. 

2.  A  man  attacked  me.  I  hit  the  man  with  a  stick. 

The  phrase  "with  a  stick"  Is  Interpreted  differently  In  the 
two  cases#  but  this  cannot  be  because  of  differing  conceptual 
relations.  The  relevant  Information  Is  that  a  phrase  like  "the 
man"  will  only  be  used  when  It  Is  clear  which  particular  man  Is 
meant#  while  "the  man  with  a  stick"  will  be  used  only  In  trying 
to  distinguish  one  particular  man  from  others. 

Another  example  used  (<Schank  1970>  p.  11)  Is  the 
disambiguation  of  the  word  "fly"  depending  on  whether  Its 
subject  Is  a  "pilot"  or  not.  If  you  know  that  Ed's  father  Is  a 
pilot#  the  sentence  "Ed’s  father  flew  to  Chicago."  should  be 
Interpreted  In  the  sense  of  "operating  a  plane".  But  there  are 
no  categorization  clues  In  the  words  "Ed"  or  "father"#  for  the 
conceptual  parser. 

It  seems  that  within  the  association  model  there  should  be 
some  way  to  make  use  of  this  Information.  If  there  were  a  node 
linked  to  "Ed"#  "father#  and  "pilot"#  then  the  network  search 
Involving  "Ed"#  "father"#  and  "fly"  would  go  through  It#  and  the 
relations  could  be  determined  from  the  path.  This  approach  Is 
deceptive#  as  adding  this  type  of  knowledge  creates  a  world  of 
false  short  paths.  This  Is  a  problem  Inherent  to  the  approach. 
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The  earlier  example  of  “goouc"  ''quill"  "pen"  "Ink"  could  be 
extended  one  place  further  from  "Ink"  to  "spot".  It  Is 
extremely  uidikely  that  a  sentence  containing  "spot"  and  "goose" 
would  actually  Involve  this  connection.  Yet  It  might  be  a  much 
shorter  path  than  the  one  actually  representing  the  connection 
In  "I  spotted  a  goose."  As  the  amount  of  knowledge  grows#  there 
will  be  a  rapidly  expanding  number  o*'  false  paths#  made  up  of 
links  which  are  Individually  very  close#  but  which  bear  no 
logical  relation  to  each  other.  Since  the  association  network 
does  not  check  the  logical  relations  of  the  links  until  after 
the  path  Is  found#  there  Is  no  immediate  way  to  sort  these  out. 

By  Including  specific  knowledge#  this  problem  Is 
exacerbated#  since  each  node  will  have  a  large  number  of  links# 
of  widely  differing  logical  types. 

There  are  various  ways  to  sneak  deduction  Into  the 
association  model#  and  for  each  simple  example#  It  Is  possible 
to  design  a  trick  which  cuts  out  the  Irrelevant  links 
(Quinlan's  distinction  between  property  and  superset  links 
<QuI11Ian  1969>  Is  an  example).  As  the  amount  of  Information  In 
the  net  Increases#  It  needs  more  ad-hoc  deductive  schemes#  and 
In  order  to  handle  language  generally,  the  association  net  will 
become  a  full-fledged  parallel  processor  using  procedures  to 
find  semantic  relations. 

In  addition  to  swamping  the  system  with  Implausible  links# 
the  association  model  can  produce  very  plausible  Incorrect 
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links.  If  the  sentence: 

"He  hated  the  landlord  so  much  he  decided  to  move  Into  the 
house  on  Harvard  St.“ 

were  given  to  a  system  like  TLC,  It  would  be  hard  to  restrain  It 
from  saying  "We  are  talking  about  a  landlord  who  owns  a  house. 
The  house  Is  located  on  Harvard  St...."  The  path  from  "landlord" 
to  "house"  will  be  as  strong  as  the  path  from  "lawyer"  to 
"client"  In  the  standard  assoc latlonal  example.  But  In  this 
case^  deduction  Is  needed  to  realize  that  the  association  Is 
wrong.  A  person  would  move  out  of  the  house  of  a  landlord  he 
hated^  not  Into  It. 

These  examples  point  out  a  serious  defect  of  non-deductive 
models.  Earler  sections  discussed  the  existence  of  areas  of 
language  comprehension  which  could  not  be  handled  without  syntax 
and  deduction.  These  examples  Indicate  that  deduction  Is 
necessary  even  for  the  tasks  for  which  the  other  models  are 
designed  —  finding  the  semantic  and  conceptual  relations 
between  the  words  In  the  sentence. 
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4.5.6  Corel usions 

Although  the  procedure  model  seems  to  account  for  much  more 
of  language  behavior  than  either  of  the  others/  there  are  some 
parts  of  language  understanding  v/here  they  are  especially 
applicable.  There  Is  no  one  aspect  of  understanding  which  can 
succeed  without  deduction/  but  the  simpler  models  can  be  of  help 
In  making  appropriate  deductions  In  many  sentences. 

In  trying  to  understand  language  at  a  deep  level/  a  system 
cannot  hope  to  simply  throw  together  the  advantages  of  these 
different  models/  but  needs  a  way  to  Integrate  them  usefully. 
This  Is  the  primary  advantage  of  the  procedure  point  of  view. 

It  Is  flexible  enough  to  make  use  of  the  other  models  In  a 
systematic  way  In  an  Integrated  system.  If  semantic  criteria 
can  simplify  parsing/  a  partial  semantic  analysis  can  be 
Included  In  the  procedure  before  syntactic  analysis.  This  could 
Involve  category  matching/  or  even  a  controlled  search  throi’gh 
some  sort  of  association  net.  These  might  be  used  as  well  In 
choosing  betv^een  meanings  of  a  word,  or  In  finding  Information 
applicable  to  a  deduction  about  the  subject  matter  In  generating 
a  response. 

The  degree  to  which  these  special  sub-models  could  be  used 
v;ould  depend  on  the  particular  application  and  the  subject  being 
dl scussed. 

A  program  to  read  poetry  or  retrieve  documents  on  the  basis 
of  vague  descriptions  of  subject  matter  may  need  a  strong 
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assr'clatlon  component.  One  written  to  answer  questions  about 
airline  schedules^  or  understand  questions  and  commands  to  a 
robot  will  be  more  procedural!/  oriented.  In  general/  a  use 
Involving  detailed  knowledge  of  any  specific  subject  can  rely  on 
deduction/  while  an  application  needing  superficial  knowledge  of 
a  wide  ran^fs  of  subjects  can  benefit  from  association/  and  will 
be  correspondingly  weak  In  Its  ability  to  give  specific 
responses. 

Specific  models  such  as  association  and  categorization  are 
subparts  of  a  model  of  language  understanding/  while  the 
procedure  model  Is  fundamentally  an  approach  to  Integrating  all 
of  the  different  sub-models  Into  a  total  semantic  theory.  It 
represents  a  point  of  view  that  no  part  of  the  process  can  be 
isolated  from  the  basic  computational  power  or  "Intelligence"  of 
the  language-understander  (whether  human  or  computer). 
Understanding  of  language/  as  well  as  other  types  of  human 
behavior/  depends  on  this  ability  to  see  the  operation  of 
Intelligence  at  every  level  of  processing. 
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Chapter  5.  Conclusions 

5.1  Teaching,  Telling  and  Learning 
One  of  the  most  Important  requirements  of  a  natural 
language  understanding  system  Is  generality.  It  should  not  be 
based  on  special  tricks  or  shortcuts  which  limit  It  to  one 
particular  subject  or  a  small  subset  of  grammar,  but  should  be 
expandable  to  really  handle  the  full  diversity  of  language.  In 
each  of  the  three  preceding  chapters  we  have  pointed  out  that 
many  approaches  to  language  understanding  are  quite  limited,  and 
have  tried  to  Illustrate  the  progression  within  each  sub-area 
towards  ..tore  general  approaches. 
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i-^Li.1  Types  Qt  Knowledge 

In  evaluating  the  flexibility  of  a  system#  we  must  consider 
the  four  different  levels  of  knowledge  It  contains. 

First#  there  Is  the  "hard  core"  which  cannot  really  be 
changed  without  remaking  the  entire  system.  This  Is  Its  "Innate 
capacity"  --  the  embodiment  of  Its  theory  of  language.  At  this 
level  we  must  deal  with  such  questions  as  whether  we  should  use 
a  top-down  transformational  parser#  a  semantic  net#  or  some 
other  approach  to  the  basic  analysis  of  a  sentence#  or  whether 
we  should  have  special  tables  of  Information  or  a  general 
notation  (such  as  the  predicate  calculus)  for  representing 
Information, 

The  second  level  of  knowledge  Is  the  complex  knowledge 
about  the  language  and  the  subject  being  discussed.  This  would 
Include  such  things  as  the  grammar  of  a  language#  or  the 
conceptual  categories  Into  which  the  speaker  divides  his  model 
of  the  world.  If  we  think  about  the  human  speaker#  this  Is  a 
type  of  knowledge  which  Is  obviously  not  Innate  (since  the 
grammar  would  be  different  for  English  and  Chinese  and  the  set 
of  concepts  used  would  differ  for  talking  about  toy  blocks  and 
talking  about  love  stories).  However  It  Is  not  something  which 
he  learns  by  being  told#  or  which  he  changes  very  easily.  Over 
a  period  of  years#  he  builds  up  a  store  of  very  complex# 
Interrelated  knowledge#  which  serves  as  a  framework  for  more 
specific  Information. 
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The  third  level  Is  our  storehouse  of  knowledge  about  the 
details  of  our  language  and  our  v;orld.  It  Includes  the  meanings 
of  words#  and  most  of  what  we  called  ''complex  knowledge"  In 
section  3.1.3.  This  would  Include  such  things  as  "A  house  built 
on  sand  cannot  stand."#  "If  you  want  to  pick  up  a  block#  first 
clear  off  Its  top."#  or  "Sunspots  cause  strange  weather.".  In 
human  terms#  this  Is  the  knowledge  we  are  continually  learning 
all  of  our  lives#  and  forms  the  bulk  of  what  we  are  taught  In 
school . 

Finally#  the  fourth  level  Is  the  set  of  specific  facts 
which  are  relevant  to  a  discussion.  This  Includes  facts  such  as 
'"Flight  342  leaves  Boston  at  noon."#  "The  red  block  Is  3  Inches 
tall."#  or  "A  banana  Is  hanging  above  the  chair.".  This  Is  the 
easiest  type  to  learn#  since  It  does  not  demand  forming  any  new 
Interrelationships.  It  Is  more  like  puting  a  new  entry  Into  a 
table  or  a  new  simple  assertion  Into  a  data  base.  There  Is  no 
sharp  distinction  between  levels  three  and  four#  but  within  any 
given  system  there  will  usually  be  two  different  ways  of 
handling  Information  corresponding  to  this  distinction.  Let  us 
look  at  the  three  areas  of  syntax#  Inference#  and  semantics#  and 
see  how  these  different  levels  of  knowledge  relate  to  language 
understanding  programs  and  the  way  they  can  learn. 
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In  syntax  It  Is  clear  that  at  the  top  level  of  knowledge 
there  will  be  a  basic  approach  to  grammar,  whether  It  be 
transformations,  pattern  matching,  or  finite  state  networks.  In 
addition,  there  must  be  some  sort  of  built  In  system  to  carry 
out  the  parsing. 

Some  programs  (such  as  the  early  translation  programs)  had 
the  grammar  built  In  as  an  Integral  part  of  the  system.  In 
order  to  add  new  syntactic  Information  It  was  necessary  to  dig 
Into  the  deepest  Innards  of  the  system  and  to  understand  Its 
details.  It  v/as  recognized  quite  early  that  this  approach  made 
them  Inflexible  and  extremely  difficult  to  change.  The  majority 
of  language  systems  have  Instead  adopted  the  use  of  a  "syntax- 
directed"  parser.  A  grammar  Is  described  by  a  series  cf  rules 
which  are  applied  by  a  uniform  parsing  procedure.  In  handling 
simple  subsets  of  English,  this  turns  grammar  Into  a  third-level 
type  of  knowledge.  We  can  add  new  single  rules  (for  example, 
adding  the  fact  that  verbs  can  have  a  modifying  adverb)  In  a  way 
similar  to  adding  words  to  a  vocabulary  —  without  worrying 
about  the  Interaction  between  rules.  This  simplicity  Is 
deceptive,  since  It  depends  on  the  simplicity  of  context-free 
grammars  for  small  subsets  of  natural  language.  Once  we  try  to 
account  for  the  complexities  of  an  entire  language  with 
something  like  a  systemic  or  transformational  grammar,  we  must 
again  pay  attention  to  the  complex  Interrelationship?  between 
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the  rules#  and  the  grammar  becomes  a  tangled  web  Into  which  any 
new  addition  must  be  carefully  fitted.  For  examples  of  the 
complexity  of  current  transformational  grammars  for  English#  see 
<<llma>.  More  recent  programs  which  use  transformat  I ona 1 
grammars  <Woods  1970>#  <Bobrow  1970>  <Thorne  1963,  1969> 
recognize  the  fact  that  syntax  Is  not  really  that  simple,  and 
adopt  a  more  Interrelated  representation  such  as  networks. 

In  our  system  we  have  used  programs  to  express  the  grammar, 
as  explained  In  chapter  2.  This  Is  not  a  return  to  the  original 
first-level  representation,  since  the  grammar  programs  are 
completely  separate  from  the  system  Itself.  One  of  the 
arguments  for  using  syntax-directed  parsers  was  that  the  grammar 
rules  could  be  expressed  In  a  uniform  way  which  did  not  depend 
on  the  details  of  the  parsing  program.  Therefore  changes  could 
be  made  more  easily  and  the  grammar  was  expandable.  By 
designing  a  special  language  for  writing  grammars,  we  can  use  a 
representation  which  Is  just  as  general  as  syntax-rule  tables, 
but  which  allows  greater  flexibility  In  designing  a  grammar,  and 
relating  It  to  semantics. 

How  difficult  Is  It  to  change  our  grammar?  For  small 
changes  (like  allowing  noun  groups  to  contain  only  a  number,  as 
In  "Are  there  any  books?  I  want  three.")  only  one  or  two 
additional  lines  of  program  would  be  needed.  For  a  more 
substantial  change  (like  adding  a  new  type  of  modifying  clause) 
we  might  need  as  many  as  a  dozen  small  additions  to  the  grammar 
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In  different  places  which  would  be  affected.  The  first  change 
could  be  done  with  little  difficulty  by  anyone  with  an 
understanding  of  PROGRAMMAR  and  section  2,3.  The  second  would 
take  a  deeper  understanding  of  how  the  grammar  Is  written/  but 
would  still  Involve  only  a  small  amount  of  programming/  and  of 
course  would  not  Involve  changing  the  basic  system  at  all.  The 
grammar  was  written  to  be  fairly  complete  and  with  expansion  In 
mind.  It  seems  flexible  enough  that  we  will  be  able  to  Include 
as  much  of  the  the  complexity  of  English  as  we  want. 

What  is  important  In  terms  of  learning  Is  that  this  Is 
level-two  knowledge  --  It  Is  the  type  of  knowledge  which  Is 
learned  once  In  a  lifetime  by  a  person  (or  computer  program)/ 
and  should  not  need  any  major  changes  after  childhood. 

The<’efore  although  It  must  be  changeable/  we  do  not  need  to 
worry  about  "quick"  learning  techniques.  If  any  learning  Is 
studied  at  this  level/  we  must  deal  In  a  sophisticated  way  with 
the  methods  used  to  learn  large  amounts  of  complex  Interrelated 
material.  Those  computer  programs  which  have  "learned"  syntax 
(<McConlogue>  <SIklossy>)  have  done  so  by  taking  such  an 
oversimplified  view  of  syntax  that  the  results  hardly  have 
significance  for  natural  language. 

At  level  three  of  our  knowledge  of  syntax/  we  have  our 
knowledge  of  particular  wordS/  their  grammatical  categories  and 
peculiarities.  We  need  to  ask/  How  easy  Is  It  to  add  new  words? 
How  much  do  we  have  to  know  about  the  grammar  to  Increase  the 
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vocabulary?  In  most  systems  there  are  a  few  words  (such  as 
*'be",  "there",  or  "than")  which  have  complex  and  unique 
grammatical  behavior.  These  are  built  Into  the  grammar 
Initially  at  level  two.  The  rest  of  the  vocabulary,  like  nouns 
and  verbs,  can  be  specified  In  a  simple  format.  Our  system  Is 
no  exception.  To  add  the  words  "cat",  "purple",  and  "walk"  to 
the  system,  we  would  only  need  to  know  the  right  abbreviations 
(from  section  2.3)  to  enter  In  LISP; 

(DEFLIST  WORD  (CAT  (NOUM  NS) )  (PURPLE(ADJ) )  (V'IALK(VB  INF  ITRNS))) 

This  says  that  "cat"  Is  a  singular  (NS)  NOUN,  "pu,*ple"  Is 
an  AOJectIve,  and  "walk"  Is  the  INFInltlve  form  of  an 
InTRaNSItIve  VerB. 

Can  we  give  this  Information  In  English?  It  would  be 
straightforward  to  add  the  right  terms  to  the  vocabulary  and  set 
up  simple  PLANNER  theorems  which  would  allow  us  to  say  "'Cat'  Is 
a  noun."  or  "'Walk'  Is  an  Intransitive  verb."  It  would  be  an 
interesting  project  to  see  how  far  this  could  be  extended.  Some 
programs  have  avoided  giving  dictionary  entries  to  these  "open 
class"  words  (like  verbs,  nouns,  and  adjectives)  and  let  the 
parser  determine  their  part  of  speech  from  context.  <Thorne 
1969>  This  approach  Is  not  generally  meaningful  for  a  complete 
language  understanding  system,  since  we  need  a  dictionary  of 
meanings.  It  could  be  used  when  adding  new  words  to  the 
system,  and  could  be  done  so  trivially  in  our  Input  prog-'ams,  by 
assigning  all  unknown  words  to  have  all  possible  "open  class" 
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grammatical  features/  then  letting  the  parser  choose  the  correct 
ones  for  the  context. 
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Inference 

In  the  cloma*n  of  Inference,  there  has  been  tremendous 
variation  In  how  different  systems  treat  knowledge.  In  the 
early  programs,  all  of  the  complex  Information  was  at  level  one 
(built  Into  the  system),  while  the  specific  facts  were  at  leve’ 
four.  As  we  have  discussed,  this  made  It  very  hard  to  modify  or 
expand  the  complex  Information  held  by  the  system.  In  the 
theorem  provers,  all  of  the  complex  Information  was  treated  at 
the  fourth  level  --  as  a  set  of  Individual  formulas  which  were 
treated  as  isolated  facts.  At  level  one,  they  have  a  uni  i'crrs 
proof  procedure  as  the  heart  of  the  system.  V.'e  have  discussed 
how  this  lack  of  Information  at  other  levels  (informat’ on  about 
the  interrelationships  between  different  theorems)  severely 
limits  this  approach.  In  our  system,  only  simple  assertions 
(such  as  "Noah  Is  the  father  of  Jafeth.",  or  "Parent-of  Is  the 
converse  of  Chlld-of.")  are  dealt  with  at  the  lowest  level.  The 
rest  of  the  knowledge  Is  In  the  form  of  PLANNER  theorems  which 
have  the  ability  to  include  Information  about  their  connections 
to  other  theorems.  Some  of  these,  such  as  the  examples  In 
section  3.1.3  about  canaries  and  thesis  evaluation,  are  at  the 
third  level,  since  they  are  not  Interwoven  Into  complex 
relationships  with  other  parts  of  the  knowledge.  Other 
theorems,  such  as  the  BLOCKS  programs  (section  3.4)  for  keeping 
track  of  a  table  full  of  objects,  are  at  level  two. 

Again  we  can  ask,  how  easy  Is  It  to  add  or  change 
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Information  at  each  of  the  levels.  At  the  two  ends#  the  answer 
is  clear.  At  the  top  we  nave  PLANNER  and  our  committment  to  Its 
kind  of  theorem-proving  procedures.  Any  change  In  this  Is  a 
major  overhaul.  At  the  bottom  level#  we  have  simple  facts  like 
"The  red  pyramid  Is  supported  by  the  green  cube."  These  are  the 
facts  which  the  system  plays  with  whenever  It  Is  conversing. 

They  can  be  changed  by  simply  telling  Information  (either  In 
English  or  PLANNER)#  and  are  changed  automatically  when  things 
happen  In  the  world  (for  example  If  we  move  the  red  pyramid). 

The  middle  levels  form  the  much  more  Interesting  problem. 

At  the . second  level  we  have  our  basic  conceptual  model  of 
the  world.  This  Includes  our  choice  of  categories  for  objects# 
ways  of  representing  actions#  time#  place#  etc.  One  of  the 
benefits  of  PLANNER  sand  of  LISP#  In  which  It  Is  embedded)  Is 
that  we  have  a  variety  of  useful  facilities  to  represent  our 
world  efficiently.  Section  3.4  described  the  BLOCKS  world#  and 
It  should  be  similarly  easy  to  define  new  worlds  of  discourse 
for  the  system  (see  below  for  examples). 

The  third  level  presents  the  most  Interesting  problems  for 
adding  new  Information  to  the  system.  It  Is  simple  to  do  so  In 
PLANNER  by  adding  new  theorems#  but  we  would  like  to  do  It  In 
English  as  well.  Of  the  previous  systems#  the  only  ones  which 
could  accept  complex  Information  In  English  were  the  theorem 
provers  which  dealt  with  It  at  the  fourth  level  (as  a  set  of 
unrelated  formulas).  In  our  sample  dialog#  we  have  some 
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examples  of  telling  the  system  simple  and  slightly  complex 
Information  In  English.  Saying  "1  like  blocks  which  are  not 

% 

red/  but  I  don't  like  anything  which  supports  a  pyramid.” 
created  two  theorems.  The  first  sayS/  "If  you  want  to  prove  I 
like  something/  prove  that  It  Is  a  block  and  that  It  Is  not 
red.”  This  Is  no  different  from  a  formula  for  any  theorem 
prover/  since  It  Is  not  related  to  the  system  In  any  complex 
way.  The  second  theorem  sayS/  "If  you  are  trying  to  prove  that 
I  like  something/  and  you  can  prove  that  It  supports  a  pyramid/ 
then  give  up.”  This  Interacts  with  the  other  goals  and  theorems/ 
but  In  a  very  specialized  way. 

Much  smarter  programs  could  be  built  to  accept  complex 
Information  and  use  It  to  actually  modify  the  PLANNER  theorems 
already  In  the  data  base.  For  example/  we  might  have  a  theorem 
to  pick  up  a  block/  but  It  falls  whenever  the  block  has 
something  on  top  of  It.  We  would  like  to  say  In  English/  "When 
you  want  to  pick  up  a  block/  first  take  everything  off  of  It.”/ 
and  have  the  system  add  this  Information  to  the  theorem  In  the 
form  of  an  additional  goal  statement  at  the  beginning.  In  order 
to  do  thIS/  the  system  must  have  not  only  a  model  of  the  world 

•  It  talks  about/  but  also  a  model  of  Its  own  behavior/  so  that  It 

can  treat  Its  own  programs  as  data  to  be  manipulated  and 
modified.  This  Is  one  of  the  most  fascinating  directions  In 
which  the  system  could  be  expanded. 

Another  Is  the  possibility  of  letting  the  system  learn  fro;ii 
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experience.  This  Is  a  complex  problem  and  can  be  dealt  with  at 
many  levels.  At  a  simplistic  level,  we  can  have  It  'Mearn" 
specific  facts.  For  example,  we  have  a  theorem  which  proves 
that  a  block  has  Its  top  clear  (by  proving  It  supports  nothing). 
As  the  last  line  of  this,  we  have  the  PLANNER  statement 
(THASSERT  (#CLEARTOP  $?X)),  which  says  that  we  should  add  to  the 
data  base  the  assertion  that  this  block  Is  clear.  If  we  then 
need  the  fact  again,  we  don't  need  to  repeat  the  deduction.  In 
a  sense  the  system  has  "learned"  this  fact,  since  It  has  been 
added  to  the  data  base  without  being  mentioned  In  the  dialog. 

But  In  another  sense.  It  hasn't  learned  any  ne\/  Information, 
since  nothing  can  be  deduced  with  this  fact  that  couldn't  have 
been  done  before  using  the  theorem  that  already  existed.  A  more 
Interesting  type  of  learning  would  be  shown  by  changing  the 
PLANNER  theorems  for  accomplishing  a  goal,  depending  on  what  had 
been  achieved  In  the  past.  For  example,  we  might  have  a  goal 
statement  with  the  recommendation  (THTBF  THTRUE)  meaning  try 
anything  you  can.  If  the  goal  Is  achieved  using  some  particular 
theorem,  we  might  have  the  system  change  the  recommendation  to 
suggest  trying  that  theorem  first.  At  a  more  advanced  stage,  we 
would  have  a  heuristic  program  which  tried  to  figure  out  why  a 
particular  chain  of  deduction  worked  or  didn't  work  In  a 
particular  case.  It  would  then  modify  the  recommendations  to 
choose  the  best  theorems  In  whatever  environments  came  up  In  the 
future.  It  might  also  recognize  the  need  for  new  theorems  In 
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some  caseS/  and  actually  build  t>^em.  This  is  perhaps  closest  to 
human  learning.  It  does  not  Involve  juggling  parameters  or 
adding  new  Isolated  bits  of  Information.  Instead  It  Involves 
figuring  out  "How  are  my  Ideas  wrong  (or  right)?"  and  "How  can  I 
change  or  generalize  them?"  It  Involves  a  kind  of  "debugging"  of 
Ideas^  and  Is  a  key  reason  for  representing  knowledge  as 
procedures. 
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5 >1.4  Semantics 

Since  semantics  Is  the  least  understood  part  of  language 
understanding.  It  Is  difficult  to  find  a  clear  body  of  "level 
one"  knowledge  on  which  to  base  a  system.  Our  system  has  a 
basic  approach  to  semantics,  explained  In  chapter  4,  but  most  of 
the  semantic  work  Is  done  at  level  two  --  the  Interrelated  group 
of  LISP  programs  for  handling  particular  semantic  jobs.  At  this 
level  we  have  two  separate  areas  of  knowledge.  The  first  Is 
knowledge  about  the  language,  and  the  way  It  Is  structured  to 
convey  meaning.  This  Includes  knowledge  such  as  "In  a  passive 
sentence,  the  syntactic  subject  Is  the  semantic  object.",  "A 
definite  noun  group  refers  to  a  particular  object  In  the  world 
model."  or  "*lt*  Is  more  likely  to  refer  to  the  subject  of  the 
previous  sentence  than  the  object."  This  Is  closely  tied  to  the 
grammar,  and  Is  about  as  hard  to  modify  as  the  grammar  programs 
themselves.  The  other  type  of  level  two  knowledge  Is  the 
network  of  "semantic  features"  described  In  section  4.2.  This 
Is  peculiar  to  the  domain  being  discussed,  and  becomes  more 
complex  as  the  range  of  discussion  Increases.  As  we  pointed 
out,  this  Is  currently  separate  from  the  network  of  "concepts" 
used  for  Inference  by  PLANNER,  but  the  two  could  be  combined. 

As  with  level  two  knowledge  In  other  areas,  this  Is  not 
something  to  be  quickly  learned  and  changed.  Our  knowledge  of 
how  language  conveys  meaning  grows  along  with  our  knowledge  of 
Its  syntactic  structure,  and  Is  just  as  seldom  modified. 
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At  the  third  level  we  have  the  bulk  of  semantic  Information 
--  the  meanings  of  Individual  words.  This  Is  the  part  which 
must  be  easy  to  change  and  expand.  As  In  most  language 
understand  I ng  systems,  this  knowledge  Is  In  the  form  of  separate 
dictionary  entries,  so  that  new  words  can  be  added  without 
changing  others.  The  definition  of  each  word  Is  a  program  In 
the  "semantic  language"  described  In  section  4.2,  and  we  gain 
great  flexibility  from  this  program  form.  The  writer  of 
semantic  definitions  does  not  have  to  be  concerned  with  the 
exact  form  of  the  grammar,  and  If  he  wants  to  enter  sample 
words,  he  can  use  a  standard  function  to  describe  them  very 
simply.  Most  words  can  be  added  by  using  the  functions  CMEANS 
and  NMEANS,  or  by  using  the  particular  simple  semantic  form 
appropriate. to  the  type  of  word  (for  example#  we  would  define 
"thirteen"  by  ((NUM  13))).  If  we  come  across  a  type  of  semantic 
problem  or  relationship  we  hadn't  anticipated,  or  which  involves 
relating  things  in  an  unusual  way,  we  can  write  a  LISP  function 
as  the  definition  of  the  word  to  perform  the  required 
operations. 

We  have  tried  to  design  our  system  so  that  It  would  be 
flexible  and  could  be  easily  adapted  to  handle  other  fields  of 
knowledge  and  to  have  a  large  vocabulary.  It  would  be  nice  to 
enter  new  definitions  in  English  Instead  of  having  to  use  the 
special  semantics  language.  In  our  sample  dialog,  the  sentence 
"A  "steeple"  Is  a  stack  which  contains  two  green  cubes  and  a 


Section  5.1.4  -  Page  437 


pyrainld."  produced  a  new  definition  for  a  noun.  This  Is  only 
possible  wh;.n  we  can  *»)tpress  the  definition  of  the  new  word  In 
terms  of  the  old  words  and  concepts.  It  Is  a  hit  deceptive  for 
a  language  understanding  system  to  allow  new  words  to  bo  added 
so  simply.  If  we  wanted  to  define  the  word  "face"/  so  that  we 
could  talk  about  the  faces  of  blocks,  the  system  would  be 
lacking  the  basic  concepts  and  relationships  necessary  to  use 
the  new  word.  This  kind  of  knowledge  Is  at  the  second  level, 
and  we  cannot  expect  to  add  It  through  a  simple  definition. 

There  must  be  a  powerful  heuristic  program  which  recognizes  the 
need  for  a  new  concept  and  which  relates  this  concept  to  the 
entire  model  of  the  world.  In  this  example.  It  would  have  to 
realize  that  a  face  Is  a  part  of  an  object,  but  Is  not  an  object 
Itself.  This  might  have  varied  consequences  throughout  the 
model,  wherever  relations  such  as  "part"  are  Involved. 

Thus  although  our  system  can  accept  definitions  of  some 
words.  It  Is  a  worthwhile  but  untried  research  project  to  design 
programs  which  will  really  be  able  to  learn  new  words  In  an 
Interesting  way.  We  believe  that  this  will  be  much  easier 
within  the  environment  of  a  problem  solving  language  like 
PLANNER,  and  that  such  programs  could  well  be  added  to  our 
system. 


a 
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5.2  Directions  for  Future  Research 
In  the  preface  wo  talked  about  using  computers  In  a  new  way. 
We  speculated  about  the  day  when  we  will  just  tell  our  computer 
what  we  want  done,  and  It  will  understand.  This  paper  has 
described  a  small  step  In  that  direction.  VIhere  Is  such 
research  leading?  VJhat  approaches  should  we  take  In  the  future? 

Vie  can  see  three  basic  directions  In  which  we  could  extend 
our  system.  First,  at  present  It  knows  only  about  a  tiny 
simplified  subject.  Second,  most  of  what  It  knows  has  to  be 
programmed,  rather  than  told  or  taught.  Finally,  we  can't  talk 
to  It  at  all!  Vie  have  to  type  our  side  of  the  conversation  and 
read  the  computer's. 

The  problem  of  widening  the  scope  of  knowledge  Involves 
much  more  than  building  bigger  memories  or  more  efficient  lookup 
methods.  If  we  want  the  computer  to  have  a  large  body  of 
knowledge,  the  Information  must  be  highly  structured.  The 
critical  Issue  Is  to  understand  the  kinds  of  organization 
needed.  One  of  the  reasons  that  our  system  Is  able  to  handle 
many  aspects  of  language  which  were  not  possible  In  earlier 
systems  Is  that  It  has  a  deep  understanding  of  the  subject  It  Is 
discussing.  There  Is  a  whole  body  of  theorems  and  concepts 
associated  with  the  words  In  the  vocabulary,  and  by  making  use 
of  this  knowledge  In  Its  question-answering  and  action.  Its 
language  behavior  Is  more  like  ours.  In  going  to  larger  areas 
of  discourse  we  cannot  give  up  this  Insistence  that  the  computer 
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must  know  what  It  is  talking  about. 

We  need  a  way  to  Integrate  large  amounts  of  heterogeneous 
knowledge  Into  a  single  system  which  can  make  use  of  it.  At  the 
same  time,  we  cannot  let  the  system  become  overburdened  and 
inefficient  by  Insisting  on  a  stifling  generality  and 
uniformity.  We  want  the  advantages  of  specialized  types  of 
knowledge  and  structure  that  can  come  from  limiting  the  subject 
to  a  small  area,  but  at  the  same  time  we  must  have  the 
flexibility  that  allows  knowledge  of  different  types  to 
Interact.  PLANNER-llke  languages  may  be  a  beginning  toward 
these  new  kinds  of  organization. 

There  are  many  different  approaches  which  can  be  taken 
towards  higher  organization  of  knowledge.  We  may  want  to  think 
in  terms  of  a  "block-structure”  of  contexts,  eacn  of  which 
carries  its  own  special  vocabulary  and  Information.  We  may 
think  of  a  network,  in  which  we  can  consider  the  "distance" 
between  two  concepts  or  words.  It  might  be  possible  to  deal 
with  a  set  of  specialized  "subroutines"  for  dealing  with 
different  kinds  of  situations.  Even  for  something  as  seemingly 
simple  as  childrens'  stories,  there  are  tremendous  complexities 
and  a  well-structured  approach  Is  necessary. 

In  section  4.1.4  we  dtsvi»-C5ed  some  of  the  ways  our  system 
could  take  advantage  of  this  large-scale  structure  of  knowledge. 
The  subject  matter  would  Influence  the  choice  of  relevant 
definitions  of  words  and  appropriate  theorems  to  be  used  In 
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deduction.  This  has  been  explored  very  little,  and  there  are 
many  possibilities  for  further  research. 

The  problem  of  learning  Is  of  great  Interest  not  only  to 
those  working  on  practical  computer  systems,  but  also  to 
psychologists  Interested  In  understanding  how  learning  takes 
place  In  other  Intelligent  systems,  such  as  people.  We  need  to 
understand  how  the  amount  of  knowledge  we  already  have  affects 
the  amount  and  the  way  we  can  learn.  Working  on  a  natural 
language  program  offers  several  advantages  for  studying  problems 
of  knowledge  and  learning.  Language  represents  a  body  of  highly 
complex  knowledge,  which  Itself  can  provide  a  rich  field  for 
learning  tasks  with  a  wide  range  of  difficulties.  Also, 
language  Is  a  major  vehicle  through  which  people  learn  about  the 
world.  In  studying  the  way  that  a  computer  could  accept  new 
Information  In  natural  language,  we  are  studying  a  key  area  In 
learning.  We  need  to  understand  the  ways  In  which  learning 
depends  on  the  organization  of  our  knowledge.  We  need  to 
explore  In  what  ways  knowing  about  Its  own  mentality  could  allow 
a  computer  to  really  learn.  This  Is  perhaps  the  most 
Interesting  possibility  for  research,  and  we  have  discussed  It 
at  length  In  Section  5.1. 

We  have  discussed  the  difficulties  involved  In  accenting 
declarative  knowledge  In  any  but  a  superficial  way.  One  of  the 
problems  most  closely  associated  with  this  Is  the  use  of  world- 
knowledge  In  understanding  declarative  sentences.  Compare  the 


Section  5.2  -  Page  441 


sentences: 

I  put  the  heavy  book  on  the  table  and  It  broke. 

I  put  the  butterfly  wing  on  the  table  and  It  broke. 

The  understardlng  of  the  referent  of  the  pronoun  ’Mt*'  must 
depend  on  the  likelihood  of  the  different  objects  breaking  or 
causing  breakage.  This  could  be  handled  by  having  the 
declarative  sentence  *' Interpreter"  try  out  both  Interpretations 
and  see  which  leads  to  more  "reasonable"  conclusions. 

Out  system  can  currently  do  this  only  If  the  knowledge  of 
the  world  needed  Is  a  specific  simple  fact  ("there  Is  no  block 
In  the  box.")  or  a  categorical  fact  ("table  can't  pick  up 
blocks.")  A  more  complex  system  Is  needed  to  accept  general 
declarative  statements  and  explore  their  consequences.  It  must 
seek  the  Interpretation  which  Is  neither  trivial  nor 
incongruous^  but  which  provides  new  Informatlc.t  as  tne  speaker 
must  have  Intended  It  to.  Contextual  factors  play  the  major 
role.  The  expectations  might  be  completely  reversed  If  the 
sentence  were  preceded  by  "The  strangest  thing  just  happenedi" 
Finally  we  have  the  problem  of  speech  communication  with 
computers.  Again  the  Issue  Is  not  one  of  more  efficient 
hardware#  but  one  of  knowledge.  Spoken  language  calls  on  the 
listener  to  fill  In  a  great  deal  from  his  own  knowledge  and 
understanding.  Words#  phrases  and  whole  Ideas  are  conveyed  by 
fragments  and  mumbles  which  often  serve  as  little  more  than  a 
clue  as  to  what  they  Intend,  The  need  for  a  truly  vertical 


Section  5.2  -  Page  442 


system  Is  ivtch  greater  for  speech  than  for  written  language. 

The  analysis  at  even  the  lowest  level  depends  on  whether  the 
result  "makes  sense."  People  can  communicate  under  conditions 
where  It  Is  nearly  Impossible  to  pick  out  Individual  words  or 
sounds  without  reference  to  meaning. 

In  our  system  we  tried  to  Integrate  the  syntactic,  semantic 
and  deductive  programs  In  a  flexible  way.  We  allow  meaning  to 
guide  the  direction  of  the  parsing.  Gur  semantic  Interpretation 
Is  guided  by  logical  deduction  and  a  rudimentary  model  of  what 
the  speaker  knows.  For  spoken  language  this  must  be  expanded. 
Perhaps  we  might  look  for  fragments  of  sentences  and  use  their 
meaning  to  help  piece  together  the  rest.  Or  possibly  we  could 
create  a  unified  system  In  which  the  deductive  portion  could 
look  at  the  context  and  propose  what  It  thought  the  speaker 
might  be  saying,  on  the  basis  of  meaning,  and  the  audible  clues 
In  the  utterance.  It  might  be  possible  to  have  a  more  multi¬ 
dimensional  analysis  In  which  prosodic  features  such  as  voice 
Intonation  could  be  used  to  recognize  Important  features  of  the 
utterance.  This  Is  not  at  all  saying  that  we  should  throw 
syntax  overboard  In  favor  of  some  sort  of  vague  relational 
structure.  Often  the  most  Important  clues  about  what  Is  being 
said  are  the  syntactic  clues.  What  Is  neeeded  Is  a  grammar 
which  can  look  for  and  analyze  the  different  types  of  Important 
patterns  rather  than  getting  tremendously  Involved  with  finding 
the  exact  details  of  structure  In  a  fixed  order.  Systemic 
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grammar  Is  a  step  In  this  direction,  and  the  use  of  programs  for 
grammars  gives  the  kind  of  flexibility  which  would  be  needed  for 
doing  this  kind  of  analysis.  It  Is  not  clear  whether  our  system 
In  Its  present  form  could  be  adapted  to  handle  spoken  language, 
but  Its  general  structure  and  the  basic  principles  of  Its 
operation  might  well  be  used. 

The  challenge  of  programming  a  computer  to  use  language  Is 
really  the  challenge  of  producing  Intelligence.  Thought  and 
language  are  so  closely  Interwoven  that  the  future  of  our 
research  In  natural  language  and  computers  will  be  neither  a 
study  of  linguistic  principles,  nor  a  study  of  "artificial” 
Intelligence,  but  rather  an  Inquiry  Into  the  nature  of 
Intelligence  Itself. 
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Appendix  A  -  Index  of  Syntactic  Features 


Underlines  Indicate  primary  description  of  feature 


-OB  148 
-0B2  148 

ACTV  116,  119,  137,  138 
/DJ  120,  121,  132,  lia 
AOJG  121,  131-132.  139 
^  ADJQ  105,  107 

ADJREL  105 

ADJUNCT  105,  1C9,  128-131 

AOV  139-141 

ADVADV  141 

AOVMEASQ  105,  107 

AOVMEASREL  105 

AGENT  116,  119,  128,  129 

AND  150,  151 

AS  131 

AUX  146 

BE  116,  137 

BINDER  IM 

BOTH  150 

BOUND  105,  109 

BUTNOT  150 

CLASF  120,  121,  124,  1^ 
CLAUSE  104-119.  121,  143, 
CLAUSEADV  141 
COMP  117-118,  123,  126-128 
COMPAR  121,  131,  132,  139 
COMPONENT  151 
COMPOUND  149-152 
COMPQ  105,  108 
CuMPREL  105,  111,  113 
DANGLING  105-107,  111,  113 
DECLARATIVE  105,  106 
DEF  123,  124,  142 
DEFPOSS  123,  127 
DEM  123,  124,  142,  145 
DET  120,  122-124,  141-143 
%  DOWNQ  105,  109 

DOWN.IEL  105,  111 

DPRT  116,  119 

EN  105,  110,  137,  138,  148 


FINITE  137 
FUTURE  133,  134 
IMPER  137 

IMPERATIVE  104,  105,  106 

INCOM  123,  125,  142 

INDEF  123,  124,  125,  142 

ING  105,  109-113,  137,  138,  148 

INGOB2  148 

IMGQ  105 

INGREL  105 

INT  116,  118,  148 

IT  116,  119 

ITRMS  116,  117,  148 

ITRNSL  116,  118,  148 

ITSUBJ  105,  113,  119 

LIST  150 

LISTA  150 

LOBJ  105,  117,  128,  129 
LOBJQ  105 
LOBJREL  105 
MAJOR  105,  106,  107 
MASS  142,  143 
148  MEASQ  105,  lOo 

MEASREL  105,  111 
,  131  MODAL  133,  134,  148 
NDET  123,  124 
NEED2  145 

NEG  123,  125,  137,  138,  142 
NFS  123,  127,  145 
NG  120-127.  128-129,  132 
,  129  NGQ  107 
NOBJ  123 
NONUM  142,  143 
NOR  150 
NOUN  120,  iii 
NPL  123,  127,  142-145 
NS  123,  127,  142-145 
NUM  120,  144 
NUMD  123, 

NUMDALONE  144 
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NUMDAN  144 
NUMOAS  144 
NUMDAT  144 
NUMDET  125 
OBJ  105,  126,  145 
OBJl  105,  112,  123 
OBJIQ  105,  108 
OBJIREL  105,  111 
OBJIUPREL  113 
OBJ2  105,  112,  123 
OBJ2Q  105,  108 
OBJ2REL  105,  111 
OBJQ  105 
OBJREL  105 

OF  123,  125,  128,  129 
OFD  142 
OFD-INCOM  142 
OFOBJ  123,  126 
OR  150 

ORD  120,  122,  IM 
PAST  133,  134,  148 
PASV  116,  119,  135,  137,  138 
POSES  123,  125,  126 
POSS  123,  126,  143,  145 
POSSDEF  145 

PREP  128,  144.  145 
PREP2  145 
PREPADV  141 

PREPG  121,  128-130 
PREPOBJ  105,  112,  123,  126,  128 

PREPQ  105,  106,  107 
PREPREL  105,  110,  111,  113,  129 

PRESENT  133,  134 

PRON  JJii. 

PRONG  123,  124,  126,  127 

PRONREL  145 

PROPN  124,  lii 

PROPNG  123,  124 

PRT  116,  118,  119,  lii 

Q  120,  128,  129,  131 


QADJ  IM 
QAUX  146 
QDET  142 

QNTFR  123,  124,  125,  142 
QUEST  123,  125,  128-132,  145 
QUESTION  105-107,  146,  148 
RELADJ  111,  113 
RELDEL  105,  111 
RELPREPG  128,  129 
REPOB  148 

REPORT  105,  112,  148 

RSNG  105,  109-113,  119,  128,  148 

RSQ  105,  109,  121 

SEC  105,  106 

SHORT  105,  107,  129 

SHORTREL  111,  113 

SU8ING  105,  112,  148 

SUBd  105,  112,  117,  123,  126,  145 

SUBJFORK  151 

SUBJQ  105,  107 

SUBJREL  105,  111 

SUBJT  116,  123 

SUBJTQ  105,  108 

SUSJTREL  105,  111,  113 

SUBQ  105 

SUBREL  105 

SUBTO  105,  112,  148 

SUBTOB  148 

SUP  121,  144 

THAN  131 

THAT  105,  112 

THERE  108,  116 

TIMl  143 

TIME  123,  143 

TIMEQ  105,  108 

TIMEREL  105,  111 

TO  105,  109-113,  137,  138,  148 

TODEL  137,  138 

TPRON  124,  132,  iM 

TPRONG  123,  124 
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TRANS  116,  117,  148 
TRANS2  116,  117,  148 
TRANS2T0Q  105 
TRANS2TOREL  105 
TRANSL  116,  117,  148 
TRANST02Q  108 
TRANST02REL  111 
UPQUEST  128,  129 
UPREL  105,  113,  128,  129 
V5PS  148 
VB  146.  148 
VBAO  141 
VBFORK  151 
VFS  148 

VG  lil-Ul/  148 
VPRT  148 

WH-  105,  106,  107 
WHRS  105,  110,  113 
WORD  139-148 
Yes-No  105,  106 
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Appendix  B  -  Sample  Parsings 

HOW  MANY  EGGS  WOULD  YOU  HAVE  BEEN  GOING  TO  USE  IN  THE  CAKE  IF 
YOU  HADN'T  LEARNED  YOUR  MOTHER'S  RECIPE  WAS  WRONG? 

(((HOW  MANY  EGGS  WOULD  YOU  HAVE  BEEN  GOING  TO  USE  IN  THE 
CAKE  IF  YOU  HADN'T  LEARNED  YOUR  MOTHER'S  RECIPE  WAS 
WRONG) 

(CLAUSE  MAJOR  QUEST  NGQUES  POLR2  ACTV  OBJIQ  TRANS) 

(((HOW  MANY  EGGS) 

(NG  QUEST  HOWMANY  NDEF  NPL  DET) 

((HOW  (QDET))  (MANY  (QDET))  (EGGS  (NOUN  NPL)))) 

(WOULD  (VB  AUX  MODAL  QAUX)) 

((YOU)  (NG  SUBJ  DEF  NS  NPL)  ((YOU  (PRON  NPL  NS  SUBJ  OBJ)))) 
((HAVE  BEEN  GOING  TO  USE)  (VG  MODAL  NAGR  (FUT  PAST  MODAL)) 

((WOULD  (VB  AUX  MODAL  QAUX)) 

(HAVE  (HAVE  VB  AUX  INF  TRANS)) 
(BEEN  (AUX  Vb  BE  EN)) 

(GOING  (VB  HRNS  ING)) 

(TO  (TO)) 

(USE  (VB  INF  TRANS  MVB)))) 

((IN  THE  CAKE)  (PREPG) 

((IN  (PLACE  PREP  PLACE)) 

((THE  CAKE) 

(NG  OBJ  DET  NS  DEF) 

((THE  (DET  NPL  NS  DEF))  (CAKE  (NOUN  NS)))))) 
((IF  YOU  HADN'T  LEARNED  YOUR  MOTHER'S  RECIPE  WAS  WRONG) 
(CLAUSE  BOUND  DECLAR  ACTV  TRANS) 

((IF  (BINDER)) 

((YOU)  (NG  SUBJ  DEF  NS  NPL)  ((YOU  (PRON  NPL  NS  SUBJ  OBJ)))) 
((HADN'T  LEARNED) 

(VG  VPL  V3PS  NEG  (PAST  PAST)) 

((HADN'T  (HAVE  VB  AUX  TRANS  PAST  VPL  V3PS  VFS  NEG)) 
(LEARNED  (VB  TRANS  REPOB  PAST  EN  MVB)))) 

((YOUR  MOTHER'S  RECIPE  WAS  WRONG) 

(CLAUSE  RSNG  REPORT  OBJ  OBJl  DECLAR  BE  INT) 

(((YOUR  MOTHER'S  RECIPE) 

(NG  SUBJ  NS  DEF  DET  POSES) 

(((YOUR  MOTHER'S) 

(NG  SUBJ  NS  DEF  DET  POSES  POSS) 

(((YOUR)  (NG  SUBJ  POSS) 

((YOUR  (PRON  NPL  NS  SUBJ  OBJ  POSS)))) 
(MOTHER'S  (NOUN  NS  POSS)))) 

(RECIPE  (NOUN  NS)))) 

((WAS)  (VG  V3PS  VFS  (PAST)) 

((WAS  (AUX  VB  BE  V3PS  VFS  PAST  MVB)))) 

((WRONG)  (ADJG  Q  COMP)  ((WRONG  (AOJ) ))))))))) ) 
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PICK  UP  ANYTHING  GREEN,  AT  LEAST  THREE  OF  THE  BLOCKS,  AND 
EITHER  A  BOX  OR  A  SPHERE  WHICH  IS  BIGGER  THAN  ANY  BRICK  OM  THE 
TABLE. 

(((PICK  UP  ANYTHING  GREEN  /,  AT  LEAST  THREE  OF  THE  BLOCKS  /,  AND 
EITHER  A  BOX  OR  A  SPHERE  WHICH  IS  BIGGER  THAN  ANY  BRICK  ON 
THE  TABLE) 

(CLAUSE  MAJOR  IMPER  ACTV  TRANS) 

(((PICK)  (VG  IMPER)  ((PICK  (VPRT  VP  INF  TRANS  MVB)))) 

(UP  (PRT)) 

((ANYTHING  GREEN  /,  AT  LEAST  THREE  OF  THE  BLOCKS  /,  AND 
EITHER  A  BOX  OR  A  SPHERE  WHICH  IS  BIGGER  THAN  ANY 
BRICK  ON  THE  TABLE) 

(NG  OBJ  OBJl  EITHER  COMPOUND  LIST  NS) 

(((ANYTHING  GREEN)  (NG  OBJ  OBJl  TPROM) 

((ANYTHING  (NS  TPRCN))  (GREEN  (ADJ)))) 
((AT  LEAST  THREE  OF  THE  BLOCKS) 

(NG  OBJ  OBJl  COMPONENT  NUMD  NUM  NPL  DET  OF) 

((AT  (AT)) 

(LEAST  (NUMD  NUMDAT)) 

(THREE  (NUM)) 

((OF  THE  BLOCKS) 

(PREPG  OF) 

((OF  (PREP)) 

((THE  BLOCKS) 

(NG  OBJ  DET  NPL  DEF) 

((THE  (DET  NPL  NS  DEF))  (BLOCKS  (NOUN  NPL)))))))) 

((A  BOX  OR  A  SPHERE  WHICH  IS  BIGGER  THAN  ANY  BRICK  ON  THE 
TABLE) 

(NG  OBJ  OBJl  COMPONENT  OR  COMPOUND  BOTH  NS) 

(((A  BOX)  (NG  OBJ  OBJl  COMPONENT  DET  NS  INDEF) 

((A  (DET  NS  INDEF))  (BOX  (NOUN  NS)))) 

((A  SPHERE  WHICH  IS  BIGGER  THAN  ANY  BRICK  OM  THE  TABLE) 
(NG  OBJ  OBJl  COMPONENT  DET  NS  INDEF) 

((A  (DET  NS  INDEF)) 

(SPHERE  (NOUN  NS)) 

((WHICH  IS  BIGGER  THAN  ANY  BRICK  ON  THE  TABLE) 

(CLAUSE  RSQ  SUBREL  BE  INT) 

(((WHICH)  (NG  RELWO  DEF  NPL)  ((WHICH  (NPL)))) 

((IS)  (VG  V3PS  (PRES)) 

((IS  (AUX  VB  BE  V3PS  PRES  MVB)))) 

((BIGGER  THAN  ANY  BRICK  ON  THE  TABLE) 

(AOJG  a  COMP  COMPAR  THAN) 

((BIGGER  (ADJ  COMPAR)) 

(THAN  (THAN)) 

((ANY  BRICK  ON  THE  TABLE) 

(NG  SUBJ  COMPAR  DET  NS  QNTFR) 

(CANY  (DET  NS  NPL  QNTFR)) 

(BRICK  (NOUN  NS)) 

((ON  THE  TABLE) 
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(PREPG  Q) 

((ON  (PREP  PLACE)) 

((THE  TABLE) 

(NG  OBJ  DET  NS  OEF) 

((THE  (DET  NPL  NS  DEF)) 

(TABLE  (NOUN  NS))))))))))))))))))))) 
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Appendix  C  -  Sample  BLOCKS  Theorems 


(DEFTHEOREM  TA-CONTAIN 

(THANTE  (X  Y  Z) 

(••AT  $?X  ?) 

fTHGOAL  (#MANIP  J?X)) 

(THGOAL  (#SUPPORT  S?Y  $?X)) 

(THOR  (THAND  (THGOAL  (#IS  S?Y  #BOX)) 
(THVSETQ  $_Z  $?Y)) 
(THGOAL  {#CONTAIN  $?Z  $?Y))) 
(THASSERT  (#CONTAIN  $?Z  $?X)))) 


(DEFTHEOREM  TC-MAKESPACE 

(THCONSE  (SURF  SIZE  OBJ  SPACE  X  (WHY  (EV))  EV) 

(#FINOSPACE  S?SURF  $?S1ZE  $?OBJ  $?SPACE) 
(THNOT  (THGOAL  (#IS  $?SURF  #BOX))) 
(MEMORY) 

TAG 

(THAND  (THGOAL  (#SUPPORT  $7Sl.'RF  $_X)) 
(THGOAL  (#GET-RID-OF  $?X) 

(THOSE  TC-GET-RID-OF>)) 
(THOR  (THGOAL  (»FINDSPACE  $?SURF 

$?SIZE 

$?OBJ 

$?SPACE> 

(THUSE  TC-FINDSPACE)) 

(THGO  TAG)) 

(MEMOREND  (#MAKESPACE  $?EV  $?SURF)))) 

(DEFTHEOREM  TC-MORE 

(THCONSE  (MEASURE  X  Y) 

(#MORE  $?MEASUPE  $?X  $?Y) 

(THVSETQ  $J-1EASURE 

(GET  $?MEASURE  (QUOTE  MEASFN))) 
(GREATER?  ($?MEASURE  $?X) 

($?MEASURE  $?Y)))) 


(DEFTHEOREM  TC-ON 

(THCONSE  (X  Y  Z) 

(#ON  $?X  $?Y) 

(THOR  (THGOAL  (#SUPPORT  $?Y  $?X)) 

(THAND  (THASVAL  $?X) 

(THGOAL  (#SUPPORT  $_Z  $?X)) 
(THGOAL  (#ON  $?Z  $?Y) 

(THUSE  TC-ON)))))) 
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(OEFTHEOREM  TC-PICKUP 

(’■HCONSE  (X  (WHY  (EV))  EV) 

(#PICKUP  $?X> 

(MEMORY) 

(THGOAL  (#GRASP  $?X)  (THOSE  TC-GRASP)) 
(THGOAL  (#RA!SEHAND) 

(THNODB) 

(THOSE  TC-RAISEHAND)) 

(MEMOREND  (#PICKOP  $?EV  $?X>))) 


(OEFTHEOREM  TCT-PICKOP 

(THCONSE  (X  EV  TIME) 

(#PICXOP  $?X  S7TIME) 

(THOR  (THAND  (THGOAL  (#P»CKOP$?EV  S?X)) 
(TIMECHK  $?EV  $?TIME)) 
(THGOAL  (#PICKOP  $?EV  $?X  'STTIME) 
(THOSE  TCTE-PICKOP))))) 


(OEFTHEOREM  TCTE-PICKOP 

(THCONSE  <X  EV  EVENT  TIME) 

(#PICKOP  $?EV  $?X  $?TIME) 

(THOR  (THANO  (THGOAL  (#PICKOP  i?EV  $?X)) 
(TIMECHK  $?EV  $?TIME))) 

(THSUCCEED)) 

(THAMONG  $?EVENT  EVENTLIST) 

(MEMO  (GET  $?EVENT  (QOOTE  TYPE)) 

(QOOTE  (#POTON  #GET-RI D-OF) ) ) 

(TIMECHK  $?EVENT  $7TIME) 

(THOR  (THGOAL  (#PUTON  $?EVENT  $7X  ?)) 

(THGOAL  (#GET-RID-OF  $7EVENT  $7X))) 

(THVSETQ  $_EV  (MAKESYM  (QUOTE  E))) 

(AND  (POTPROP  $7EV 

(POTPROP  $?EV 

(GET  $?EVENT  (QUOTE  END)) 
(QUOTE  START)) 

(QUOTE  END)) 

(POTPROP  $7EV  (QUOTE  #PICKUP)  (QUOTE  TYPE)) 
(POTPROP  $7EV  $?EVENT  (QUOTE  WHY)) 

(SETQ  EVENTLIST  (CONS  S7EV  EVENTLIST)) 

(THASSERT  (#PICKUP  $?EV  $7X)))) 

(OEFTHEOREM  TE-CONTAIN  (THERASING  (X  Y) 

(#AT  $7X  7) 

(THGOAL  (^CONTAIN  $_Y  $?X)) 
(THERASE  (#CONTAIN  S7Y  S7X)))) 
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Appendix  D  -  Sample  PROGRAMMAR  Program 
Excerpt  from  Noun  Group  Parser 


OETl 

(COND  ((ISO  H  NS)  (FQ  NS))  (T  (FQ  NPL))) 

(OR  NN  (AND  (FQ  NUMBER)  (GO  INCOM))) 

NUMBER 
(FQ  DET) 

((NQ  OF)  OF  ADJ) 

QNUM 

(dSQ  H  NONUM)  OF  NIL) 

((AND  (PARSE  NUM)  ( FO  NUM))  NIL  OF) 

((COND  ((EQ  (SM  H)  1)  (AND  (CQ  MS)  (RQ  NPL))) 

((CQ  NPL)  (RQ  NS))) 

NIL 

(NUMD) 

INCOM) 

((EQ  (CADDR  (NB  H))  (Q  NO))  ADvJ  NIL) 

OF 

((AND  (NQ  OF)  (PARSE  PREPG  OF))  SMOF  NIL) 

((EQ  (CADDR  (NB  H))  (Q  NONE))  INCOM  ADo) 

SMOF 
(FQ  OF) 

((OR  SMN  (SMNGOF)  (NOT  (POP)))  RETSM  INCOM) 

ADJ 

((PARSE  ADJ)  ADJ  CLASF  NIL) 

EPR 

((OR  (ISQ  H  SUP)  (ISQ  H  COMPAR))  NIL  REDUC) 

(FQ  ADJ) 

(AND  (EQ  (CADDAR  M)  (Q  OF)) 

(PARSE  PREPG  OF) 

(OR  SMN  (SMNGOF)  (AND  (ERT  NOUN  SMNGOF3)  (GO  FAIL))) 

(FQ  OF) 

(GO  RETSM)) 

(GO  INCOM) 

CLASF 

((OR  (PARSE  VB  ING  (CLASF))  (PARSE  VB  EN  (CLASF))  (PARSE  CLASF)) 
CLASF 
NIL 
REDUC) 

NOUN 

((PARSE  NOUN)  NIL  RED2) 

((AND  (CQ  TIME)  (NOT  (ISQ  H  TIMl)))  REDl  NIL) 

(SETQ  T1  FE) 

(COND  ((AND  (ISQ  H  MASS)  (OR  (CQ  PART)  (NOT  (CQ  DET)))) 

(FQ  MASS))) 

(COND  ((MOT  (ISQ  H  NPL))  (RQ  NPL  PART))) 

(COND  ((NOT  (ISQ  H  NS))  (RQ  NS))) 

(COND  ((AND  (NOT  (CQ  DET))  (NOT  (CQ  NUMD)))  (*  H) 

(TRNSF  NPL  MASS))) 
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((MEET  FE  (Q  (NS  NPL  PART  MASS)))  NIL  REDO) 

SMNG 

((OR  (NOT  (CQ  OFOBJ))  (CQ  DET))  NIL  FAIL) 

(OR  SMN  (SMNGl)  (AND  (ERT  NOUN  SMNGl)  (GO  FAIL))) 
((NOT  (ISQ  H  POSS))  NIL  POSS  RETSM) 

((AND  (NEXTW  TO) 

(MEET  FE  (Q  (COMP  SUBJ))) 

(PARSE  CLAUSE  RSO  TO 

(OR  SMN  (SMRSQ  (SM  H)  SM)  (GO  POPRET))) 

RETSM 

NIL) 

((AND  (OR  (NEXTW  AS)  (NQ  COMPAR)) 

(PARSE  ADJG  THANNEED) 

(OR  SMN 

(AND  (NULL  N) 

(CQ  SUBJ) 

(ISQ  (*  C  PV)  AUX) 

(ISQ  PT  BE) 

(GO  POPRET)) 

(SMPREP  (QUOTE  ADJ)) 

(GO  POPRET))) 

RETSM 

NIL) 


Apperiydl  X 

/ 


Appendix  E  -  Sample  Dictionary  Entries 


Syntactic  Definitions 

(def 1  I  St  word 

(LITTLE  (ADJ)) 

(LONG  (ADJ)) 

(MAKE  (VB  INF  TRANS)) 

(MOVE  (VB  INF  TRANS)) 

(NAME  (NOUN  NS  VB  INF  TRANS)) 
(NARROV>/  (ADJ)) 

(NICE(ADJ)) 

(OBJECT  (NOUN  NS)) 

(OFF  (PRT)) 

(OUT  (PRT)) 

(PICK  (VPRT  VB  INF  TRANS)) 

(PUT  (INF  PAST  VB  TRANSL  VPRT)) 
(POINTED  (ADJ)) 

(PYRAMID  (NOUN  NS)) 

(RED  (ADJ)) 

(RELEASE  (VB  TRANS  INF)) 

(RIGHT  (NOUN  NS)) 

(ROUND  (ADJ))) 

(DEFLIST  WORDl 

(BEGAN  (BEGIN  (PAST)  (INF))) 

(GAVE  (GIVE  (PAST)  (INF))) 

(SAW  (SEE  (PAST)  (INF))) 

(TOLD  (TELL  (PAST)  (INF)))) 
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Semantic  Definitions 


(DEFLIST  SMNTC 

(A  ((DET  T))) 

(ABOVE  ((PREP  ((T  (#LOC  #ABOVE  T)))))) 

(AFTER  ((BINDER  (END  NIL)))) 

(ALL  ((DET  (COND  ((CQ  OF)  (Q  ALL)) 

((MEET  (Q  (NUM  DEF))  FE)  (Q  DEF)) 

((Q  NDET)))))) 

(BALL  ((NOUN  (NMEANS  ((#MANIP  # ROUND) 

((#IS  ***  #BAl.L))))))) 

(BIG  ((MEASURE  ((#SIZE  (fPHYSOB)  T))) 

(ADJ  (NMEANS  ((#PHYSOB  #810 

((#MORE  #SI2E  ***  (200  200 

200)))))))) 

(BLACK  ((ADJ  (#COLOR  #BLACK)))) 

(BLOCK  ((NOUN  (NMEANS  ((#MANIP  #RECTANGULAR) 

((#IS  #BLOCK))))))) 

(BLUE  ((ADJ  (#COLOR  #BLUE)))) 

(BY  ((PREP  ((T  (CMEANS  ( ( ( ( #PHYSOB) )  ((#PHYSOB))) 

(#NEXTO  #1  #2  •TIME) 

NIL)))))))  ! 

(COLOR  ((NOUN  (NMEANS  ((#COLOR)  ((#1$  #COLOR) ) ) ) ) ) ) 

(CONTAIN  ((VB  ((TRANS  (CMEANS  ((((#EOX))  ((#PHYSOB))) 

(#CONTAIN  #1  #2  *TIME)  ' 

NIL) 

((<(#CONSTRUCT)) 

((#THIN6))) 

(#PART  #2  #1  *TIME) 

NIL))))))) 

(CUBE  ((NOUN  (NMEANS  ((#MANIP  #RECTANGULAR)  * 

((#IS  #BLOCK)  ] 

(#EQDIM  ••*)))))))  i 

(EVERYTHING  ((TPRON  (QUOTE  ALL))))  I 

(FEWER  ((NUMD  (LIST  (Q  <)  NUM))))  j 

(FOUR  ((NUM  4)))  i 

(FRIEND  ((NOUN  (NMEANS  ((#PERSON>  j 

((#IS  •**  fPERSON))))))) 

(GRAB  ((VB  ((TRANS  (#GRASP) ) ) ) ) ) 

(GRASP  ((VB  ((TRANS  ( #GRASP) ) ) ) ) ) 

(I  ((PRON  (SETQ  SM  (Q  (FRIEND)))))) 

(IT  ((PRON  (SMIT  (Q  IT))))) 

(NICE  ((ADJ  (NMEANS  ((#THING) 

((#LIKE  :FRIEND  ***))))))) 

(NOW  ((ADV  (OR  (EQ  (CADR  (ASSQ  (QUOTE  TIME)  FE))  .  | 

(QUOTE  ;NOW))  ! 

(ERT  NOW  DEFINITION)))))  ! 
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(THE  ((DET  T))) 

(THEY  ((PRON  (SMIT  (Q  THEY))))) 

(THICK  ((AD*J  (NMEANS  ((#PHYSOB)  ((#MORE  #THICKNESS 

(0  200  0)))))) 

(MEASURE  ((^THICKNESS  (#PHYS0B)  T))))) 

(THIN  ((ADd  (NMEANS  ((#PHYSOB)  ((#MORE  #THICKNESS 

(0  200  0) 
***))))) 

(MEASURE  ((#THICKNESS  (#PHYSOB)  NIL))))) 

(THING  ((NOUN  (NMEANS  ((#THING  #VAGUE  #PHYSOB) 

((#IS  ***  ?))))))) 

(WHO  ((PRONREL  (SETQ  SM  (QUOTE  (ANIMATE)))) 

(PRON  (SETQ  SM  (QUOTE  (ANIMATE)))))) 

(WHY  ((QADJ  (FQ  v/HY))))) 


Appendix  F  -  Page  457 


Appendix  F  -  PLANNER  Data  for  Dialog  In  Section  1.3 


(#IS  :B1  #BLOCK) 

(#iS  ;82  ^PYRAMID) 

(#iS  :B3  tBLOCK) 

(#IS  :B4  ^PYRAMID) 

(#IS  ;B5  #PYRAMID) 

(#IS  :B6  #BLOCK) 

(#IS  :B7  #BLOCK) 

(#IS  ;B10  #BL0CK) 

(#IS  #RED  #COLOR) 

(#IS  #BLUE  #COLOR) 

(#IS  #GREEN  #C0L0R) 

(#IS  fWHiTE  #COLOR) 

(#IS  #BLACK  #C0L0R) 

(#IS  #RECT ANGULAR  #SHAPE) 
(#IS  #R0UN0  #SHAPE) 

(#IS  #POINTED  #SHAPE) 

(#IS  :SHRDLU  #R0B0T) 

(#IS  ;FRIEND  #PERSON) 

(#IS  :HAND  #HAND) 

(#AT  ;B1  (100  100  0)) 

(#AT  ;B2  (100  100  100)) 
(#AT  :B3  (400  0  0)) 

(#AT  ;B4  (640  640  1)) 

(#AT  :B5  (500  100  200)) 
(#AT  :B6  (0  ^00  0)) 

(#AT  :B7  (0  240  300)) 

(#AT  ;B10  (300  640  0)) 
(#SUPP0RT  ;B1  :B2) 
(#SUPP0RT  :B3  :B5) 
(#SUPP0RT  ;B6  :B7) 
(#CLEART0P  :B2) 

(#CLEART0P  :b4) 

(#CLEART0P  :B5) 

(#CLEART0P  ;B7) 

(#CLEART0P  ;B10) 

(#MANIP  :B1) 

(#MANIP  ;B2) 

(#MANIP  ;B3) 

(#MANIP  ;B4) 

(#MANIP  :BS) 


(#MAdlP  :B6) 

(#MANIP  :B7) 

(#MANIP  :B10) 

(#SUPP0RT  :TABLE  ;B1) 
(#SUPP0RT  :TABLE  ;B3) 
(#SUPP0RT  :B0X  :B4) 
(#SUPP0RT  :TABLE  ;B10) 
(^SUPPORT  .'TABLE  ;B6) 
(^SUPPORT  .'TABLE  .'BOX) 

(#AT  :BOX  (600  600  0)) 

(#IS  ;B0X  #B0X) 

(#IS  :TABLE  #TABLE) 
(#C0NTAIN  ;BOX  :B4) 

(#SHAPE  ;B1  #R£CTANGULAR) 
(#SHAPE  ;B3  #RECTANGULAR) 
(#SHAPE  ;B2  fPOINTED) 
(#SHAPE  :B4  ^POINTED) 
(#SHAPE  :B5  ^POINTED) 
(#SHAPE  :B6  #RECTANGULAR) 
(#SHAPE  :B7  #RECTANGULAR) 
(#SHAPE  :B10  #RECTANGULAR) 
(#COLOR  ;B1  #RED) 

(#C0LOR  :B2  #GREEN) 

(#C0LOR  .'B3  #GREEN) 

(#C0LOR  :B4  #BLUE) 

(#COLOR  ;B5  #RED) 

(#COLOR  ;B6  #RED) 

(#C0LOR  ;B7  #GREEN) 

(#C0LOR  :B10  #BLUE) 

(#COLOR  ;B0X  #WHITE) 
(♦COLOR  ;TABLE  #BLACK) 
(♦CALL  ISHROLU  SHRDLU) 
(♦CALL  iFRiENO  YOU) 


i 

1 

i 

I 


Some  of  the  data  Is  entered  InltldUy.  The  rest  can  be 
deduced  and  asserted  by  simple  antecedent  theorems. 
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